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HUMAN MONOCLONAL ANTIBODY 

Field of the Invention 

5 This invention relates to novel human monoclonal 

antibodies (mAbs) and to the genes encoding same. More 
specifically, this invention relates to human monoclonal 
antibodies specifically reactive with an epitope of the 
fusion (F) protein of Respiratory Syncytial Virus (RSV) . 
10 Such antibodies are useful for the therapeutic and/or 
prophylactic treatment of RSV infection in human 
patients, particularly infants and young children. 

Background of the Invention 

15 Respiratory syncytial virus (RSV) is the major 

cause of lower respiratory disease in children, giving 
rise to predictable annual epidemics of bronchiolitis 
and pneumonia in children worldwide. The virus is 
highly contagious, and infections can occur at any age. 

20 Comprehensive details concerning RSV infection and its 
clinical features can be obtained from excellent recent 
reviews by Mcintosh, K. and R. M. Chanock, In: 
"Respiratory Syncytial Virus", Ch. 38, B.N. Fields ed. , 
Raven Press (1990) and Hall, C.B., In: "Textbook of 

25 Pediatric Disease" Feigin and Cherry, eds . , W.B. 
Saunders, pgs 1247-1268 (1987). 

RSV is distributed worldwide. One of the most 
remarkable features of the epidemiology of RSV virus, as 
mentioned above, is the consistent pattern of infection 

3 0 and disease. Other respiratory viruses cause epidemics 
at irregular intervals or exhibit a mixed 
endemic /epidemic pattern, but RSV is the only 
respiratory viral pathogen that produces a sizable 
epidemic every year in large urban centers . In the 
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temperate areas of the world, RSV epidemics have 
occurred primarily in the late fall, winter or spring 
but never during the summer. The occurrence and spread 
of infection within a community is characteristic and 
5 easily diagnosed, leading to sharp rises in cases of 

bronchiolitis and pediatric pneumonia and the number of 
hospital admissions of young children with acute lower 
respiratory tract disease. Other respiratory viral 
agents that occur in outbreaks are rarely present at the 

10 same time as RSV. 

Primary RSV infection occurs in the very young. 
Zero to 2 year old infants are the most susceptible and 
represent the primary affected population. In this 
group, 1 out of 5 will develop lower respiratory (below 

15 larynx) disease upon infection and this ratio stays the 
same upon reinfection. By 1 year of age, 25-50% of 
infants have specific antibodies as a result of natural 
infection and this is close to 100% by age 4-5. Thus, 
virtually all children have been infected before they 

20 have entered school. 

Age, sex, socioeconomic and environmental factors 
can all influence the severity of disease. 
Hospitalization is required in 1-3% of cases of RSV 
infection and is usually of long duration (up to 3 

25 weeks) . The high morbidity of RSV infection, especially 
in infancy, has also been implicated in the development 
of respiratory problems later in life. With current 
intensive care in the U.S. and the other developed 
countries, overall mortality for normal subjects is low 

30 (less than 2% of hospitalized subjects) . However, 

mortality is much higher in less developed countries 
and, even in developed countries, mortality is high in 
certain risk groups such as in infants with underlying 
cardiac condition (cyanotic congenital heart disease) or 
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respiratory disease (bronchopulmonary dysplasia) where 
the progression of symptoms may be rapid. For instance, 
mortality in infants with cyanotic congenital heart 
disease has been reported to be as high as 37%. In 
5 premature infants apneic spells due to RSV infection may 
occur and, in rare cases, cause neurologic or systemic 
damage. Severe lower respiratory tract illness 
(bronchiolitis and pneumonia) is most common in patients 
under six months of age. Infants who have apparently 

10 recovered completely from this illness may display 
symptomatic respiratory abnormalities for years 
(recurrent wheezing, decreased pulmonary function, 
recurrent cough, asthma, and bronchitis) . 

Immunity to RSV appears to be short-lived, thus 

15 reinfections are frequent. The mechanisms by which the 
immune system protects against RSV infection and 
reinfection are not well understood. It is clear, 
however, that immunity is only partially protective 
since reinfection is common at all ages, and sometimes 

2 0 occurs in infants only weeks after recovery from a 

primary infection. Both serum and secretory antibodies 
(IgA) have been detected in response to RSV infection in 
adults as well as in very young infants. However, the 
titers of serum antibodies to the viral F or G 
25 glycoprotein, as well as of neutralizing antibodies 

found in infants (1-8 months of age) are 15-25% of those 
found in older subjects. These reduced titers may 
contribute to the increased incidence of serious 
infection in younger children. 

3 0 Evidence for the role of serum antibodies in 

protection against RSV virus has emerged from 
epidemiological as well as animal studies . In adults 
exposed naturally to the virus, susceptibility 
correlated well with low serum antibody level . In 
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infants, titers of maternally transmitted antibodies 

correlate with resistance to serious disease [Glezen, 
W.P. et al., J. Pediatr . 98:708-715 (1981)]. Other 
studies show that the incidence and severity of lower 
5 respiratory tract involvement is diminished in the 

presence of high serum antibody [Mcintosh, K. et al . , J . 
Infect. Pis . 138:24-32 (1978)] and high titers of 
passively administered serum neutralizing antibodies 
have been shown to be protective in a cotton rat model 

10 of RSV infection [Prince, G. A. et al . , Virus Res . 
3 : 193-206 (1985) ] . 

Children lacking cell-mediated immunity are unable 
to overcome their infection and shed virus for many 
months in contrast to children with normal immune 

15 systems. Similarly, nude mice infected with RSV virus 
persistently shed virus. These mice can be cured by 
adoptive transfer of primed T cells [Cannon, M. J. et 
al., Immunology 62:133-138 (1987)]. 

In summary, it appears that both cellular and 

2 0 humoral immunity are involved in protection against 

infection, reinfection and RSV disease and that although 
antigenic variation is limited, protective immunity is 
not complete even after multiple exposures . 

RSV, belonging to the family paramyoxoviridae , is a 

2 5 negative-strand unsegmented RNA virus with properties 
similar to those of the paramyxoviruses. It has, 
however been placed in a separate genus Pneumovirus, 
based on morphologic differences and lack of 
hemagglutinin and neuraminidase activities. RSV is 

30 pleomorphic and ranges in size from 150-3 00 nm in 

diameter. The virus matures by budding from the outer 
membrane of a cell and virions appear as membrane-bound 
particles with short, closely spaced projections or 
"spikes". The RNA genome encodes 10 unique viral 
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polypeptides ranging in size from 9.5 kDa to 160 kDa 

[Huang, Y. T. and G. W. Wertz, J. Virol . 43:150-157 
(1982)]. Seven proteins (F, G, N, P, L, M, M2 ) are 
present in RSV virions and at least three proteins (F, 
5 G, and SH) are expressed on the surface of infected 
cells. The F protein [SEQ ID NO: 20] has been 
conclusively identified as the protein responsible for 
cell fusion since specific antibodies to this protein 
inhibit syncytia formation in vitro and cells infected 

10 with vaccinia virus expressing recombinant F protein 
form syncytia in the absence of other RSV virus 
proteins. In contrast, antibodies to the G protein do 
not block syncytia formation but prevent attachment of 
the virus to cells . 

15 RSV can be divided into two antigenically distinct 

subgroups, (A & B) [Mufson, M. A. et al . , J . Gen ' 1 . 
Virol. 66:2111-2124 (1985)]. This antigenic dimorphism 
is linked primarily to the surface attachment (G) 
glycoprotein [Johnson, R. A. et al . , Proc. Nat ' 1 . Acad. 

20 Sci. USA 84:5625-5629 (1987)]. Strains of both group A 
and B circulate simultaneously, but the proportion of 
each may vary unpredictably from year to year . An 
effective therapy must therefore target both subgroups 
of the virus and this is the reason for the selection of 

25 the highly conserved surface fusion (F) protein as 
target antigen for mAb therapy as will be discussed 
later . 

The induction of neutralizing antibodies to RSV 
virus appears to be limited to the F and G surface 
3 0 glycoproteins. Of these two proteins, the F protein is 
the major target for cross -reactive neutralizing 
antibodies associated with protection against different 
strains of RSV virus. In addition, experimental 
vaccination of mice or cotton rats with F protein also 
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results in cross protection. The antigenic relatedness 
of the F protein across strains and subgroups of the 
virus is reflected in its high degree of homology at the 
amino acid level. In contrast, in the two subgroups and 
5 various strains of RSV, antigenic dimorphism was linked 
primarily to the G glycoprotein. The F protein has a 
predicted molecular weight of 68-70 kDa; a signal 
peptide at its N-terminus; a membrane anchor domain at 
its C terminus; and is cleaved proteolyt ically in the 

10 infected cell prior to virion assembly to yield 

disulfide linked F 2 and Fi . Five neutralizing epitopes 
have been identified within the F protein sequence [SEQ 
ID NO: 20] and map to residues 205-225; 259-2 78; 2 89- 
299; 483-488 and 417-438. Studies to determine the 

15 frequency of sequence diversion in the F protein [SEQ ID 
NO: 20] showed that the majority of the neutralizing 
epitopes were conserved in all of the 2 3 strains of RSV 
virus isolated in Australia, Europe, and regions of the 
U.S. over a period of thirty years. In another study, 

20 seroresponses of forty three infants and young children 
to primary infection with subgroup A or a subgroup B 
strain showed that responses to homologous and 
heterologous F antigens were not significantly 
different, while the G proteins of the subgroup A and B 

25 strains were quite unrelated. Moreover, antibody 

inhibition of virus-mediated cell fusion in vitro versus 
inhibition of infection correlates best with protection 
in animal models and fusion inhibition is primarily 
restricted to F protein specific antibodies. 

3 0 Prophylactic treatment for RSV infection is thus 

desirable for the high risk groups of children as well 
as for all children in underdeveloped countries. 
However, a vaccine for RSV infection is not currently 
available. Severe safety issues surrounding an 
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attenuated whole virus vaccine tested in the 1960s, as 
well as the potential of induced immunopathology 
associated with the newer candidate subunit vaccines 
make the prospects of a vaccine in the near future 
5 appear remote. To date one drug therapy, Ribavirin, a 
broad spectrum antiviral, has been approved. Ribavirin 
has gained only minimal acceptance owing to problems of 
administration, mild toxicity and questionable efficacy. 
In the majority of cases, hospitalized children receive 

10 no drug therapy and receive only intensive supportive 

care which is extremely costly. It is clear that there 
is a need for a safe, effective and easily administered 
drug for the treatment of RSV infection. 

The use of passive antibody therapy in humans is 

15 well documented and is being used to treat other 
infectious diseases such as hepatitis and 
cytomegalovirus . The feasibility of passive antibody 
treatment /protection against RSV has been well 
established using animal models. Most of the earlier 

20 passive transfer studies in animals against infectious 

agents, including RSV, utilized murine mABs . Studies in 
animals have clearly demonstrated that polyclonal and 
monoclonal antibody against both F and G glycoprotein 
can confer passive protection in RSV virus infection 

25 when given prophylac tically or therapeutically [Prince, 
et al . , supra ] . In these studies, passive transfer of 
neutralizing F or G mAbs to mice, cotton rats or 
monkeys, significantly reduce or completely prevent 
replication of the RSV virus in the lungs. However, as 

30 discussed above, clearly, the F protein is the more 
important target for antibody therapy. 

Recently, the FDA has approved for use intravenous 
gammaglobulins (IVIG) isolated from pooled human sera. 
Initial reports from this study had been encouraging 
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[Groothuis, J. R. et al . , Ant imicrob . Agents Chemo . 
35 (7 ): 1469-1473 (1991)]. However, generic shortcomings 
of IVTGs exist and include, without limitation, the fact 
that such products are human blood derived and grams of 
5 antibody often need to be administered to achieve an 
effective dose. 

Alternatively, monoclonal antibodies have been 
employed. The advantages of such an approach include: a 
higher concentration of specific antibody can be 

10 achieved thereby reducing the amount of globulin 

required to be given; the reliance on direct blood 
products can be eliminated; the levels of antibody in 
the preparation can be more uniformly controlled and the 
routes of administration can be extended. While passive 

15 immunotherapy employing monoclonal antibodies from a 

heterologous species (e.g., murine) has been suggested 
(See: PCT Application PCT/US94 / 0 8 699 , Publication No. WO 
95/04081), one alternative to reduce the risk of an 
undesirable immune response on the part of the patient 

2 0 directed against the foreign antibody is to employ 

"humanized" antibodies. These antibodies are 
substantially of human origin, with only the 
Complementarity Determining Regions (CDRs) being of non- 
human origin. Particularly useful examples of this 
25 approach are disclosed in PCT Application 

PCT/GB91/01554, Publication No. WO 92/04381 and PCT 
Application PCT/GB93 /00725 , Publication No. WO93/20210. 
Clinical trials are on-going to evaluate the efficacy of 
humanized antibodies for treatment of RSV infection in 

3 0 young children. 

A second and more preferred approach is to employ 
fully human mAbs . Unfortunately, there have been few 
successes in producing human monoclonal antibodies 
through classic hybridoma technology. Indeed, 
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acceptable human fusion partners have not been 
identified and murine myeloma fusion partners do not 
work well with human cells, yielding unstable and low 
producing hybridoma lines. However, recent advances in 
5 molecular biology and immunology make it now possible to 
isolate human mABs, particularly directed against 
foreign infectious agents . 

Fully human mAbs to RSV F protein [ SEQ ID NO: 20] 
remain a desirable option for the treatment of this 

10 disease. Although some success has been reported in 

obtaining fragments of such mAbs [Barbas, C.F. et al . , 
Proc . Nat'l. Acad. Sci . USA 89:10164-10168 (1992); 
Crowe, J. E. et al., Proc. Nat'l. Acad. Sci. USA 91: 
1386-1390 (1994) and PCT application number 

15 PCT /US 9 3 / 0 87 8 6 , published as WO94/06448, March 31, 
1994)], the achievement of such results is not 
straightforward. Novel human mABs, when and however 
obtained, are particularly useful alone or in 
combination with existing molecules to form 

2 0 immunotherapeutic compositions. 

There exists a need in the art for useful 
prophylactic compositions for the prevention or passive 
treatment of RSV. 

2 5 Brief Description of the Invention 

In one aspect, this invention provides fully human 
monoclonal antibodies and functional fragments thereof 
specifically reactive with an F protein epitope of RSV 
and capable of neutralizing RSV infection. These human 

3 0 mABs specific for the F protein of RSV virus may be 

useful to passively treat or prevent infection. 

In another aspect, the present invention provides 
modifications to neutralizing single chain Fv fragments 
(scFV) specific for the F protein of RSV produced by 
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random combinatorial cloning of human antibody sequences 
and isolated from a filamentous phage Fab display 
library . 

In still another aspect, there is provided a 
5 reshaped or altered human antibody containing human 
heavy and light chain constant regions from a first 
human donor and heavy and light chain variable regions 
or the CDRs thereof derived from human neutralizing 
monoclonal antibodies for the F protein of RSV derived 

10 from a second human donor. 

In yet another aspect, the present invention 
provides a pharmaceutical composition which contains one 
(or more) altered or reshaped antibodies and a 
pharmaceutically acceptable carrier. 

15 In yet another aspect, the invention provides a 

pharmaceutical composition comprising at least one dose 
of an immunotherapeutically effective amount of the 
reshaped, altered or monoclonal antibody of this 
invention in combination with at least one additional 

20 monoclonal, altered or reshaped antibody. A particular 
embodiment is provided in which the additional antibody 
is an anti-RSV antibody distinguished from the subject 
antibody of the invention by virtue of being reactive 
with a different epitope of the RSV F protein antigen 

2 5 than the subject antibody of the invention. 

In a further aspect, the present invention provides 
a method for passive immunotherapy of RSV disease in a 
human by administering to said human an effective amount 
of the pharmaceutical composition of the invention for 

3 0 the prophylactic or therapeutic treatment of RSV 

infection . 

In yet another aspect, the present invention 
provides methods for, and components useful in, the 
recombinant production of human and altered antibodies 
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(e.g., engineered antibodies, CDRs , Fab or F(ab) 2 
fragments, or analogs thereof) which are derived from 
human neutralizing monoclonal antibodies (mAbs) for the 
F protein of RSV. These components include isolated 
5 nucleic acid sequences encoding same, recombinant 

plasmids containing the nucleic acid sequences under the 
control of selected regulatory sequences which are 
capable of directing the expression thereof in host 
cells (preferably mammalian) transfected with the 

10 recombinant plasmids . The production method involves 
culturing a transfected host cell line of the present 
invention under conditions such that the human or 
altered antibody is expressed in said cells and 
isolating the expressed product therefrom. 

15 In still another aspect of the invention is a 

method to diagnose the presence of RSV in a human which 
comprises contacting a sample of biological fluid with 
the human antibodies and altered antibodies and 
fragments thereof of the instant invention and assaying 

20 for the occurrence of binding between said human 

antibody (or altered antibody, or fragment) and RSV. 

Other aspects and advantages of the present 
invention are described further in the detailed 
description and the preferred embodiments thereof. 

25 

Brief Description of the Drawings 

Fig. 1A is a graph illustrating the competition of 
Gk-1 scFV phage binding with RSV19 mAb [International 
patent publication No. WO92/04381, published March 19, 
30 1992] . 

Fig. IB is a graph illustrating the competition of 
GX-1 scFV phage binding with RSV B4 mAb [International 
patent publication No. WO93/20210, published October 14, 
1993] . 
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Fig. 2 is a graph illustrating virus neutralization 

by scFV phages, GX-1 , GX-3 , and G K -1 with RSV strain 273. 

Fig. 3 illustrates the DNA sequence [ SEQ ID NO : 1] 
and protein sequence (amino acids reported in single 
5 letter code) [ SEQ ID NO: 2] for the GX-1 light chain 

variable region, processed N- terminus through framework 
IV. 

Fig. 4 illustrates the DNA sequence [SEQ ID NO : 3] 
and protein sequence (amino acids reported in single 
10 letter code) [SEQ ID NO: 4] for the GA.-1 heavy chain 

variable region, processed N- terminus through framework 
IV. 

Fig. 5 illustrates the cloning strategy used for 
the construction of the GX-1 monoclonal antibody. The 

15 heavy chain V region was cloned into the pCD derivative 
vector as a Xhol - ApaT fragment. The entire light 
chain V region was cloned into the pCN derivative 
vector, 43-lpcn, as a SacX - Avrll fragment. Details 
are described below. 

2 0 Fig. 6 provides a comparison of the heavy chain 

amino acid sequences of the GX-1 single chain F v [SEQ ID 
NO: 5] and various monoclonal antibodies of this 
invention. The amino acid sequences of the heavy chains 
for the A [SEQ ID NO : 7] and B [SEQ ID NO : 8] constructs 

25 are shown. Numbering of the residues is based on the 

germline (GL) gene Dp58 [ SEQ ID NO: 6] , beginning at the 
mature processed amino terminus and ending at CDR3 . The 
"-" indicates identity to the preceding sequence (eg., A 
compared to B) . Bold residues correspond to the leader 

30 region, and to CDRs 1-3 . 

Fig. 7 provides a comparison of the light chain 

amino acid sequences of the GX-1A single chain F v [SEQ ID 
NO: 9] and various monoclonal antibodies of this 
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invention. The amino acid sequences of the light chains 
for the A [SEQ ID NO: 11] and B [ SEQ ID NO: 12] 
constructs are shown. Numbering of the residues in the 
Vk region is based on the germline (GL) gene DpL8 [SEQ 
5 ID NO: 10] , beginning at the mature processed amino 

terminus and ending at CDR3 . For reference to framework 
4, the actual numbering is also shown for GA.-1A. As in 
Fig. 6, the "-" indicates identity to the preceding 
sequence . 

10 Figs. 8A to 8F illustrate the continuous DNA 

sequence [SEQ ID NO: 13] of the expression plasmid GA.- 

lApcd containing the RSV neutralizing human GA.-1 mAb for 
the heavy chain. The start of translation, leader 
peptide, amino- terminal processing site, carboxy 
15 terminus of the GA.-1 heavy chain, and Eco RI restriction 
endonuclease cleavage site are shown. 

Figs. 9A to 9E illustrate the continuous DNA 

sequence [SEQ ID NO: 14] of the expression plasmid GX- 
lApcn containing the RSV neutralizing human GA.-1 mAb for 

2 0 the light chain. The corresponding features for the 

light chain as for Figs. 8A-8F are shown. 

Figs. 10A and 10B illustrate the continuous DNA 
sequence [SEQ ID NO: 15] of the coding region of the 
heavy chain of plasmid G?i-lBpcd. Bolded residues 
25 indicate differences from the full vector sequence for 
GX.-lApcd in Figs. 8A-8F [SEQ ID NO: 13]. 

Fig. 11 is the DNA sequence [SEQ ID NO: 16] of the 
coding region for the light chain of plasmid GA.-lBpcn. 
Bolded residues indicate differences from the full 

3 0 vector sequence for GA.-lApcn in Figs. 9A-9E [SEQ ID NO: 

14] . 
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Detailed Description of the Invention 

This invention provides useful human monoclonal 
antibodies (and fragments thereof) reactive with the F 
protein of RSV, isolated nucleic acids encoding same and 
5 various means for their recombinant production as well 

as therapeutic, prophylactic and diagnostic uses of such 
antibodies and fragments thereof. 

I. Definitions . 

As used in this specification and the claims, the 

10 following terms are defined as follows : 

"Altered antibody" refers to a protein encoded by 
an altered immunoglobulin coding region, which may be 
obtained by expression in a selected host cell. Such 
altered antibodies are engineered antibodies (e.g., 

15 chimeric, humanized, or reshaped or immunologically 

edited human antibodies) or fragments thereof lacking 
all or part of an immunoglobulin constant region, e.g., 
Fv, Fab, or F(ab' ) 2 and the like. 

"Altered immunoglobulin coding region" refers to a 

20 nucleic acid sequence encoding an altered antibody of 
the invention or a fragment thereof . 

"Reshaped human antibody" refers to an altered 
antibody in which minimally at least one CDR from a 
first human monoclonal donor antibody is substituted for 

25 a CDR in a second human acceptor antibody. Preferrably 
all six CDRs are replaced. More preferrably an entire 
antigen combining region (e.g., Fv, Fab or F(ab')2 ) from 
a first human donor monoclonal antibody is substituted 
for the corresponding region in a second human acceptor 

3 0 monoclonal antibody. Most preferrably the Fab region 
from a first human donor is operatively linked to the 
appropriate constant regions of a second human acceptor 
antibody to form a full length monoclonal antibody. 
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"First immunoglobulin partner" refers to a nucleic 
acid sequence encoding a human framework or human 
immunoglobulin variable region in which the native (or 
naturally-occurring) CDR-encoding regions are replaced 
5 by the CDR-encoding regions of a donor human antibody. 

The human variable region can be an immunoglobulin heavy 
chain, a light chain (or both chains), an analog or 
functional fragments thereof. Such CDR regions, located 
within the variable region of antibodies 

10 (immunoglobulins) can be determined by known methods in 
the art. For example, Rabat et al . ( Sequences of 
Proteins of Immunological Interest , 4th Ed., U.S. 
Department of Health and Human Services, National 
Institutes of Health (1987)) disclose rules for locating 

15 CDRs . In addition, computer programs are known which 
are useful for identifying CDR regions / structures . 

"Second fusion partner" refers to another 
nucleotide sequence encoding a protein or peptide to 
which the first immunoglobulin partner is fused in frame 

2 0 or by means of an optional conventional linker sequence 
(i.e., operatively linked). Preferably the fusion 
partner is an immunoglobulin gene and when so, it is 
referred to as a "second immunoglobulin partner". The 
second immunoglobulin partner may include a nucleic acid 

2 5 sequence encoding the entire constant region for the 

same (i.e., homologous - the first and second altered 
antibodies are derived from the same source) or an 
additional (i.e., heterologous) antibody of interest. 
It may be an immunoglobulin heavy chain or light chain 

3 0 (or both chains as part of a single polypeptide) . The 

second immunoglobulin partner is not limited to a 
particular immunoglobulin class or isotype. In 
addition, the second immunoglobulin partner may comprise 
part of an immunoglobulin constant region, such as found 
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in a Fab, or F(ab) 2 (i.e., a discrete part of an 
appropriate human constant region or framework region) . 

A second fusion partner may also comprise a sequence 
encoding an integral membrane protein exposed on the 
5 outer surface of a host cell, e.g., as part of a phage 
display library, or a sequence encoding a protein for 
analytical or diagnostic detection, e.g., horseradish 
peroxidase (HRP) , |3-galactosidase , etc. 

The terms Fv, Fc , Fd, Fab, or F(ab')2 are used with 
10 their standard meanings [see, e.g., Harlow et al . , 
Antibodies A Laboratory Manual , Cold Spring Harbor 
Laboratory, (1988) ] . 

As used herein, an "engineered antibody" describes 
a type of altered antibody, i.e., a full-length 
15 synthetic antibody (e.g., a chimeric, humanized, 

reshaped or immunologically edited human antibody as 
opposed to an antibody fragment) in which a portion of 
the light and/ or heavy chain variable domains of a 
selected acceptor antibody are replaced by analogous 

2 0 parts from one or more donor antibodies which have 

specificity for the selected epitope. For example, such 
molecules may include antibodies characterized by a 
humanized heavy chain associated with an unmodified 
light chain (or chimeric light chain), or vice versa. 
25 Engineered antibodies may also be characterized by 

alteration of the nucleic acid sequences encoding the 
acceptor antibody light and/or heavy variable domain 
framework regions in order to retain donor antibody 
binding specificity. These antibodies can comprise 

3 0 replacement of one or more CDRs (preferably all) from 

the acceptor antibody with CDRs from a donor antibody 
described herein. 

A "chimeric antibody" refers to a type of 
engineered antibody which contains naturally-occurring 
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variable region (light chain and heavy chains) derived 
from a donor antibody in association with light and 
heavy chain constant regions derived from an acceptor 
antibody from a heterologous species . 
5 A "humanized antibody" refers to a type of 

engineered antibody having its CDRs derived from a non- 
human donor immunoglobulin, the remaining 
immunoglobulin-derived parts of the molecule being 
derived from one (or more) human immunoglobul in ( s ) . In 

10 addition, framework support residues may be altered to 
preserve binding affinity [see, e.g., Queen et al . , 
Proc. Nat ' 1 . Acad. Sci. USA , 8_6 : 10 02 9 -10 03 2 (1989), 
Hodgson et al . , Bio/Technology, 9:421 (1991)]. 

An "immunologically edited antibody" refers to a 

15 type of engineered antibody in which changes are made in 
donor and/or acceptor sequences to edit regions in 
respect of cloning artifacts, germ line enhancements, 
etc. aimed at reducing the likelihood of an 
immunological response to the antibody on the part of a 

2 0 patient being treated with the edited antibody. 

The term "donor antibody" refers to an antibody 
(monoclonal, or recombinant) which contributes the 
nucleic acid sequences of its variable regions, CDRs, or 
other functional fragments or analogs thereof to a first 
25 immunoglobulin partner, so as to provide the altered 
immunoglobulin coding region and resulting expressed 
altered antibody with the antigenic specificity and 
neutralizing activity characteristic of the donor 
antibody. One donor antibody suitable for use in this 

3 0 invention is a Fab fragment of a human neutralizing 

monoclonal antibody designated as Fab Fab GX-1 is 

defined as a having the variable light and heavy chain 
DNA and amino acid sequences GX.-1 as shown in Figs. 3, 
4, 8A-8F and 9A-9E [SEQ ID NOS : 1-4, 13 and 14] . 
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The term "acceptor antibody" refers to an antibody 
(monoclonal or recombinant) from a source genetically 
unrelated to the donor antibody, which contributes all 
(or any portion, but preferably all) of the nucleic acid 
5 sequences encoding its heavy and/or light chain 

framework regions and/or its heavy and/or light chain 
constant regions to the first immunoglobulin partner. 
Preferably a human antibody is the acceptor antibody. 
"CDRs" are defined as the complementarity 

10 determining region amino acid sequences of an antibody 
which are the hypervariable regions of immunoglobulin 
heavy and light chains [see, e.g., Kabat et al . , 
Sequences of Proteins of Immunological Interest , 4th 
Ed., U.S. Department of Health and Human Services, 

15 National Institutes of Health (1987)]. There are three 
heavy chain and three light chain CDRs (or CDR regions) 
in the variable portion of an immunoglobulin. Thus, 
"CDRs" as used herein refers to all three heavy chain 
CDRs, or all three light chain CDRs (or both all heavy 

20 and all light chain CDRs, if appropriate) . CDRs provide 
the majority of contact residues for the binding of the 
antibody to the antigen or epitope. CDRs of interest in 
this invention are derived from donor antibody variable 
heavy and light chain sequences, and include analogs of 

25 the naturally occurring CDRs, which analogs also share 
or retain the same antigen binding specificity and/or 
neutralizing ability as the donor antibody from which 
they were derived. 

By "sharing the antigen binding specificity or 

30 neutralizing ability" is meant, for example, that 

although Fab GA.-1 may be characterized by a certain 
level of antigen affinity, a CDR encoded by a nucleic 
acid sequence of Fab GA.-1 in an appropriate structural 
environment may have a lower, or higher affinity. It is 
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expected that CDRs of Fab GX-1 in such environments will 
nevertheless recognize the same epitope (s) as does the 
intact Fab GX-1 . A "functional fragment" is a partial 
heavy or light chain variable sequence (e.g., minor 
5 deletions at the amino or carboxy terminus of the 

immunoglobulin variable region) which retains the same 
antigen binding specificity and/or neutralizing ability 
as the antibody from which the fragment was derived. 

An "analog" is an amino acid sequence modified by 
10 at least one amino acid, wherein said modification can 
be a chemical modification, or a substitution or a 
rearrangement of a few amino acids (i.e., no more than 
10) , which modification permits the amino acid sequence 
to retain the biological characteristics, e.g., antigen 
15 specificity and high affinity, of the unmodified 
sequence. For example, (silent) mutations can be 
constructed, via substitutions, when certain 
endonuclease restriction sites are created within or 
surrounding CDR-encoding regions . 
20 Analogs may also arise as allelic variations. An 

"allelic variation or modification" is an alteration in 
the nucleic acid sequence encoding the amino acid or 
peptide sequences of the invention. Such variations or 
modifications may be due to degeneracy in the genetic 
25 code or may be deliberately engineered to provide 
desired characteristics. These variations or 
modifications may or may not result in alterations in 
any encoded amino acid sequence. 

The term "effector agents" refers to non-protein 
3 0 carrier molecules to which the altered antibodies, 

and/or natural or synthetic light or heavy chains of the 
donor antibody or other fragments of the donor antibody 
may be associated by conventional means . Such non- 
protein carriers can include conventional carriers used 
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in the diagnostic field, e.g., polystyrene or other 
plastic beads, polysaccharides, e.g., as used in the 
BIAcore (Pharmacia) system, or other non-protein 
substances useful in the medical field and safe for 
5 administration to humans and animals. Other effector 
agents may include a macrocycle, for chelating a heavy 
metal atom, or radioisotopes. Such effector agents may 
also be useful to increase the half-life of the altered 
antibodies, e.g., polyethylene glycol. 

10 II. Combinatorial Cloning. 

As mentioned above, a number of problems have 
hampered the direct application of the hybridoma 
technology [G. Kohler and C. Milstein, Nature , 256: 495- 
497 (1975)] to the generation and isolation of human 

15 monoclonal antibodies. Among these are a lack of 

suitable fusion partner myeloma cell lines used to form 
hybridoma cell lines as well as the poor stability of 
such hybridomas even when formed. These shortcomings 
are further exacerbated in the case of RSV because of 

20 the paucity of viral specific B cells in the peripheral 
circulation. Therefore, the molecular biological 
approach of combinatorial cloning is preferred. 

Combinatorial cloning is disclosed generally in PCT 
Publication No. WO90/14430. Simply stated, the goal of 

25 combinatorial cloning is to transfer to a population of 
bacterial cells the immunological genetic capacity of a 
human cell, tissue or organ. It is preferred to employ 
cells, tissues or organs which are immunocompetent. 
Particularly useful sources include, without limitation, 

3 0 spleen, thymus, lymph nodes, bone marrow, tonsil and 
peripheral blood lymphocytes. The cells may be 
optionally RSV stimulated in vitro, or selected from 
donors which are known to have produced an immune 
response or donors who are HIV + but asymptomatic. 



20 



WO 00/69462 PCT/US00/13694 

The genetic information isolated from the donor 
cells can be in the form of DNA or RNA and is 
conveniently amplified by Polymerase Chain Reaction 
(PCR) or similar techniques. When isolated as RNA the 
5 genetic information is preferably converted into cDNA by 
reverse transcription prior to amplification. The 
amplification can be generalized or more specifically 
tailored. For example, by a careful selection of PCR 
primer sequences, selective amplification of 

10 immunoglobulin genes or subsets within that class of 
genes can be achieved. 

Once the component gene sequences are obtained, in 
this case the genes encoding the variable regions of the 
various heavy and light antibody chains, the light and 

15 heavy chain genes are associated in random combinations 
to form a random combinatorial library. Various 
recombinant DNA vector systems have been described to 
facilitate combinatorial cloning [see: PCT Publication 
No. WO90/14430 supra ; Scott and Smith, Science 249:386- 

20 406 (1990); or U. S. Patent 5,223,409]. Having 

generated the combinatorial library, the products can, 
after expression, be conveniently screened by biopanning 
with RSV F protein or, if necessary, by epitope blocked 
biopanning as described in more detail below. 

25 As described herein, it is preferred to use single 

chain antibodies for combinatorial cloning and screening 
and then to convert them to full length mAbs after 
selection of the desired candidate molecules. However, 
Fab fragments of mAbs can also be used for cloning and 

30 screening. 

III. Antibody Fragments . 

The present invention contemplates the use of scFv, 
Fab, or F(ab')2 fragments to derived full-length mAbs 
directed against the F protein of RSV. Although these 
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fragments may be independently useful as protective and 
therapeutic agents in vivo against RSV-mediated 
conditions or in vitro as part of an RSV diagnostic, 
they are employed herein as a component of a reshaped 
5 human antibody. A scFv fragment contains the light and 
heavy chain variable regions joined by a linker of about 
12 amino acids in either a light-linker-heavy or a 
heavy-linker-light orientation. A Fab fragment contains 
the entire light chain and amino terminal portion of the 

10 heavy chain; and a F(ab') 2 fragment is the fragment 
formed by two Fab fragments bound by additional 
disulfide bonds . RSV binding monoclonal antibodies 
provide sources of scFv or Fab fragments which can be 
obtained from a combinatorial phage library [see, e.g., 

15 Winter et al . , Ann . Rev . Immunol . , 12^433-455 (1994) or 
Barbas et al . , Proc. Nat ' 1 . Acad. Sci . (USA) 89, 10164- 
10168 (1992) , which are both hereby incorporated by 
reference in their entireties] . 

IV. Anti-RSV Antibody Amino Acid and Nucleotide 

2 0 Sequences of Interest. 

The Fab GX-1 or other antibodies described herein 
may contribute sequences, such as variable heavy and/or 
light chain peptide sequences, framework sequences, CDR 
sequences, functional fragments, and analogs thereof, 
25 and the nucleic acid sequences encoding them, useful in 
designing and obtaining various altered antibodies which 
are characterized by the antigen binding specificity of 
the donor antibody. 

As one example, the present invention thus provides 

3 0 variable light chain and variable heavy chain sequences 

from the RSV human Fab GX-lA and sequences derived 
therefrom. The heavy chain variable region of Fab GX-1A 
is illustrated by Figs. 4, 8A-8F and 10A-10B [SEQ ID 
NOS: 3-4, 13 and 15] . 
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The nucleic acid sequences of this invention, or 
fragments thereof, encoding the variable light chain and 
heavy chain peptide sequences are also useful for 
mutagenic introduction of specific changes within the 
5 nucleic acid sequences encoding the CDRs or framework 

regions, and for incorporation of the resulting modified 
or fusion nucleic acid sequence into a plasmid for 
expression. For example, silent substitutions in the 
nucleotide sequence of the framework and CDR-encoding 

10 regions can be used to create restriction enzyme sites 
which would facilitate insertion of mutagenized CDR 
(and/or framework) regions. These CDR-encoding regions 
may be used in the construction of reshaped human 
antibodies of this invention. 

15 Taking into account the degeneracy of the genetic 

code, various coding sequences may be constructed which 
encode the variable heavy and light chain amino acid 
sequences, and CDR sequences of the invention as well as 
functional fragments and analogs thereof which share the 

2 0 antigen specificity of the donor antibody. The isolated 
nucleic acid sequences of this invention, or fragments 
thereof, encoding the variable chain peptide sequences 
or CDRs can be used to produce altered antibodies, e.g., 
chimeric or humanized antibodies, or other engineered 

2 5 antibodies of this invention when operatively combined 

with a second immunoglobulin partner. 

It should be noted that in addition to isolated 
nucleic acid sequences encoding portions of the altered 
antibody and antibodies described herein, other such 

3 0 nucleic acid sequences are encompassed by the present 

invention, such as those complementary to the native 
CDR-encoding sequences or complementary to the human 
framework regions surrounding the CDR-encoding regions. 
Such sequences include all nucleic acid sequences which 
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by virtue of the redundancy of the genetic code are 
capable of encoding the same amino acid sequence as 
given in Figs. 3 and 4 [SEQ ID NOS : 2 and 4] . Figs. 6 
and 7 [SEQ ID NOS: 5-12] provide representations of such 
5 sequences. Other useful DNA sequences encompassed by 
this invention include those sequences which hybridize 
under stringent hybridization conditions [See: T. 
Maniatis et al . , Molecular Cloning (A Laboratory 
Manual) , Cold Spring Harbor Laboratory (1982), pages 387 

10 to 3 89] to the DNA sequences encoding the GX-1 

antibodies (e.g., sequences of Figs. 3, 4, 8A-8F through 
11 [SEQ ID NOS: 1-4, 13-16]) and which retain the 
antigen binding properties of those antibodies . An 
example of one such stringent hybridization condition is 

15 hybridization at 4XSSC at 65°C, followed by a washing in 
0 . 1XSSC at 65°C for an hour. Alternatively an exemplary 
stringent hybridization condition is in 50% formamide, 
4XSSC at 42°C. Preferably, these hybridizing DNA 
sequences are at least about 18 nucleotides in length, 

20 i.e., about the size of a CDR. 

V. Altered Immunoglobulin Coding Regions and 
Altered Antibodies . 

Altered immunoglobulin coding regions encode 
altered antibodies which include engineered antibodies 

25 such as chimeric antibodies, humanized, reshaped, and 
immunologically edited human antibodies. A desired 
altered immunoglobulin coding region contains CDR- 
encoding regions in the form of scFv regions that encode 
peptides having the antigen specificity of an RSV 

3 0 antibody, preferably a high affinity antibody such as 
provided by the present invention, inserted into an 
acceptor immunoglobulin partner. 

When the acceptor is an immunoglobulin partner, as 
defined above, it includes a sequence encoding a second 

24 



WO 00/69462 PCT/US00/13694 

antibody region of interest, for example, an Fc region. 
Immunoglobulin partners may also include sequences 
encoding another immunoglobulin to which the light or 
heavy chain constant region is fused in frame or by 
5 means of a linker sequence. Engineered antibodies 

directed against functional fragments or analogs of RSV 
may be designed to elicit enhanced binding with the same 
antibody . 

The immunoglobulin partner may also be associated 

10 with effector agents as defined above, including non- 
protein carrier molecules, to which the immunoglobulin 
partner may be operatively linked by conventional means. 

Fusion or linkage between the immunoglobulin 
partners, e.g., antibody sequences, and the effector 

15 agent may be by any suitable means, e.g., by 

conventional covalent or ionic bonds, protein fusions, 
or hetero-bif unctional cross-linkers, e.g., 
carbodiimide, glutaraldehyde , and the like. Such 
techniques are known in the art and readily described in 

20 conventional chemistry and biochemistry texts. 

Additionally, conventional linker sequences which 
simply provide for a desired amount of space between the 
second immunoglobulin partner and the effector agent may 
also be constructed into the altered immunoglobulin 

25 coding region. The design of such linkers is well known 
to those of skill in the art. 

In addition, signal sequences for the molecules of 
the invention may be modified to enhance expression. 
For example the reshaped human antibody having the 

3 0 signal sequence and CDRs derived from the Fab GA.-1 heavy 
chain sequence, may have the original signal peptide 
replaced with another signal sequence such as the 
Campath leader sequence [Page, M. J. et al . , 
BioTechnology 9:64-68(1991)]. 
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An exemplary altered antibody, a reshaped human 
antibody, contains a variable heavy and the entire light 
chain peptide or protein sequence having the antigen 
specificity of Fab GA.-1, fused to the constant heavy 
5 regions C;_;-i-C H -3 derived from a second human antibody. 

In still a further embodiment, the engineered 
antibody of the invention may have attached to it an 
additional agent. For example, the procedure of 
recombinant DNA technology may be used to produce an 

10 engineered antibody of the invention in which the Fc 
fragment or C H -2C H -3 domain of a complete antibody 
molecule has been replaced by an enzyme or other 
detectable molecule (i.e., a polypeptide effector or 
reporter molecule) . 

15 Another desirable protein of this invention may 

comprise a complete antibody molecule, having full 
length heavy and light chains, or any discrete fragment 
thereof, such as the Fab or F(ab') 2 fragments, a heavy 
chain dimer, or any minimal recombinant fragments 

20 thereof such as an F v or a single-chain antibody (SCA) or 
any other molecule with the same specificity as the 
selected donor Fab GX-1 . Such protein may be used in 
the form of an altered antibody, or may be used in its 
unfused form. 

2 5 Whenever the immunoglobulin partner is derived from 

an antibody different from the donor antibody, e.g., any 
isotype or class of immunoglobulin framework or constant 
regions, an engineered antibody results. Engineered 
antibodies can comprise immunoglobulin (Ig) constant 

3 0 regions and variable framework regions from one source, 

e.g., the acceptor antibody, and one or more (preferably 
all) CDRs from the donor antibody, e.g., the anti-RSV 
antibody described herein. In addition, alterations, 
e.g., deletions, substitutions, or additions, of the 



26 



WO 00/69462 PCT/US00/13694 

acceptor mAb light and/or heavy variable domain 
framework region at the nucleic acid or amino acid 
levels, or the donor CDR regions may be made in order to 
retain donor antibody antigen binding specificity or to 
5 reduce potential immunogenici ty . 

Such engineered antibodies are designed to employ 
one (or both) of the variable heavy and/or light chains 
of the RSV mAb (optionally modified as described) or one 
or more of the below-identified heavy or light chain 

10 CDRs . The engineered antibodies of the invention are 

neutralizing, i.e., they desirably inhibit virus growth 
in vitro and in vivo in animal models of RSV infection. 

Such engineered antibodies may include a reshaped 
human antibody containing the human heavy and light 

15 chain constant regions fused to the RSV antibody 
functional fragments. A suitable human (or other 
animal) acceptor antibody may be one selected from a 
conventional database, e.g., the RABAT® database, Los 
Alamos database, and Swiss Protein database, by homology 

20 to the nucleotide and amino acid sequences of the donor 
antibody. A human antibody characterized by a homology 
to the framework regions of the donor antibody (on an 
amino acid basis) may be suitable to provide a heavy 
chain constant region and/or a heavy chain variable 

25 framework region for insertion of the donor CDRs. A 
suitable acceptor antibody capable of donating light 
chain constant or variable framework regions may be 
selected in a similar manner. It should be noted that 
the acceptor antibody heavy and light chains are not 

3 0 required to originate from the same acceptor antibody. 

Desirably the heterologous framework and constant 
regions are selected from human immunoglobulin classes 
and isotypes, such as IgG (subtypes 1 through 4), IgM, 
IgA and IgE . The Fc domains are not limited to native 
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sequences, but include mutant variants known in the art 
that alter function. For example, mutations have been 
described in the Fc domains of certain IgG antibodies 
that reduce Fc-mediated complement and Fc receptor 
5 binding [see, e.g., A. R. Duncan et al . , Nature , 

332:563-564 (1988); A. R. Duncan and G . Winter, Nature , 
332:738-740 (1988); M.-L. Alegre et al . , J . Immunol . , 
148:3461-3468 (1992); M . -H . Tao et al . , J . Exp . Med . , 
178:661-667 (1993); and V. Xu et al . J. Biol. Chem . , 

10 269:3469-2374 (1994)]; alter clearance rate [J.-K. Kim 

et al . , Eur. J. Immunol ., 24:542-548 (1994)]; and reduce 
structural heterogeneity [S. Angal et al . , Mol . Immunol . 
30:105-108 (1993)]. Also, other modifications are 
possible such as oligomeri zat ion of the antibody by 

15 addition of the tailpiece segment of IgM and other 
mutations [R. I. F. Smith and S. L. Morrison, 
Biotechnology 12:683-688 (1994); R. I. F. Smith et al . , 
J. Immunol . , 154: 2226-2236 (1995)] or addition of the 
tailpiece segment of IgA [I. Kariv et al . , J . Immunol . , 

20 157: 29-38 (1996)]. However, the acceptor antibody need 
not comprise only human immunoglobulin protein 
sequences. For instance a gene may be constructed in 
which a DNA sequence encoding part of a human 
immunoglobulin chain is fused to a DNA sequence encoding 

25 a non- immunoglobulin amino acid sequence such as a 
polypeptide effector or reporter molecule. 

The altered antibody thus preferably has the 
structure of a natural human antibody or a fragment 
thereof, and possesses the combination of properties 

3 0 required for effective therapeutic use, e.g., treatment 
of RSV mediated diseases in man, or for diagnostic uses. 

It will be understood by those skilled in the art 
that an altered antibody may be further modified by 
changes in variable domain amino acids without 
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necessarily affecting the specificity and high affinity 
of the donor antibody (i.e., an analog) . It is 
anticipated that heavy and light chain amino acids may 
be substituted by other amino acids either in the 
5 variable domain frameworks or CDRs or both. 

Particularly preferred is the immunological editing of 
such reconstructed sequences as illustrated in the 
examples herein. 

In addition, the variable or constant region may be 
10 altered to enhance or decrease selective properties of 
the molecules of the instant invention, as described 
above. For example, dimerization , binding to Fc 
receptors, or the ability to bind and activate 
complement [see, e.g., Angal et al . , Mol . Immunol , 
15 3_0:105-108 (1993); Xu efc al . , J. Biol. Chem , 269 : 3469- 
3474 (1994); and Winter et al . , EP 307,434-B]. 

Such antibodies are useful in the prevention and 
treatment of RSV mediated disorders, as discussed below. 
VI. Production of Altered antibodies and 

2 0 Engineered Antibodies . 

The resulting reshaped human antibodies of this 
invention can be expressed in recombinant host cells, 
e.g., COS, CHO or myeloma cells. A conventional 
expression vector or recombinant plasmid is produced by 
25 placing these coding sequences for the altered antibody 
in operative association with conventional regulatory 
control sequences capable of controlling the replication 
and expression in, and/or secretion from, a host cell. 
Regulatory sequences include promoter sequences, e.g., 

3 0 CMV promoter, and signal sequences, which can be derived 

from other known antibodies. Similarly, a second 
expression vector can be produced having a DNA sequence 
which encodes a complementary antibody light or heavy 
chain. Preferably this second expression vector is 



29 



WO 00/69462 PCT/US00/13694 

identical to the first except insofar as the coding 
sequences and selectable markers are concerned. This 
ensures as far as possible that each polypeptide chain 
is functionally expressed. Alternatively, the heavy and 
5 light chain coding sequences for the altered antibody 
may reside on a single vector. 

A selected host cell is co- transf ected by 
conventional techniques with both the first and second 
vectors (or simply transfected by a single vector) to 

10 create the transfected host cell of the invention 

comprising both the recombinant or synthetic light and 
heavy chains. The transfected cell is then cultured by 
conventional techniques to produce the engineered 
antibody of the invention. The production of the 

15 antibody which includes the association of both the 

recombinant heavy chain and light chain is measured in 
the culture by an appropriate assay, such as an enzyme- 
linked immunosorbent assay (ELISA) or radioimmunoassay 
(RIA) . Similar conventional techniques may be employed 

2 0 to construct other altered antibodies and molecules of 
this invention. 

Suitable vectors for the cloning and subcloning 
steps employed in the methods and construction of the 
compositions of this invention may be selected by one of 

2 5 skill in the art. For example, the conventional pUC 

series of cloning vectors, may be used. One vector used 
is pUC19, which is commercially available from supply 
houses, such as Amersham (Buckinghamshire, United 
Kingdom) or Pharmacia (Uppsala, Sweden) . Any vector, 

30 which is capable of replicating readily, has an 

abundance of cloning sites and selectable genes (e.g., 
antibiotic resistance) , and is easily manipulated, may 
be used for cloning. Thus, the selection of the cloning 
vector is not a limiting factor in this invention. 
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Similarly, the vectors employed for expression of 
the engineered antibodies according to this invention 
may be selected by one of skill in the art from any 
conventional vectors . Preferred vectors include for 
5 example plasmids pCD or pCN . The vectors also contain 
selected regulatory sequences (such as CMV promoters) 
which direct the replication and expression of 
heterologous DNA sequences in selected host cells. 
These vectors contain the above described DNA sequences 

10 which code for the engineered antibody or altered 

immunoglobulin coding region. In addition, the vectors 
may incorporate the selected immunoglobulin sequences 
modified by the insertion of desirable restriction sites 
for ready manipulation. 

15 The expression vectors may also be characterized by 

genes suitable for amplifying expression of the 
heterologous DNA sequences, e.g., the mammalian 
dihydrof olate reductase gene (DHFR) . Other preferable 
vector sequences include a polyadenylation (polyA) 

2 0 signal sequence, such as from bovine growth hormone 

(BGH) and the betaglobin promoter sequence (betaglopro) . 
The expression vectors useful herein may be synthesized 
by techniques well known to those skilled in this art. 
The components of such vectors, e.g. replicons, 
25 selection genes, enhancers, promoters, signal sequences 
and the like, may be obtained from commercial or natural 
sources or synthesized by known procedures for use in 
directing the expression and/or secretion of the product 
of the recombinant DNA in a selected host. Other 

3 0 appropriate expression vectors of which numerous types 

are known in the art for mammalian, bacterial, insect, 
yeast, and fungal expression may also be selected for 
this purpose. 
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The present invention also encompasses a cell line 
transfected with a recombinant plasmid containing the 
coding sequences of the engineered antibodies or altered 
immunoglobulin molecules thereof. Host cells useful for 
5 the cloning and other manipulations of these cloning 

vectors are also conventional. However, most desirably, 
cells from various strains of E. coli are used for 
replication of the cloning vectors and other steps in 
the construction of altered antibodies of this 

1 0 invent ion . 

Suitable host cells or cell lines for the 
expression of the engineered antibody or altered 
antibody of the invention are preferably mammalian cells 
such as CHO, COS, a fibroblast cell (e.g., 3T3), and 

15 myeloid cells, and more preferably a CHO or a myeloid 
cell. Human cells may be used, thus enabling the 
molecule to be modified with human glycosylat ion 
patterns. Alternatively, other eukaryotic cell lines 
may be employed. The selection of suitable mammalian 

20 host cells and methods for transformation, culture, 
amplification, screening and product production and 
purification are known in the art. See, e.g., Sambrook 
et ai . , Molecular Cloning (A Laboratory Manual) , 2nd 
edit., Cold Spring Harbor Laboratory (1989) . 

25 Bacterial cells may prove useful as host cells 

suitable for the expression of the recombinant scFvs, 
Fabs and MAbs of the present invention [see, e.g., 
Pluckthun, A., Immunol. Rev. , 130 : 151-188 (1992)]. The 
tendency of proteins expressed in bacterial cells to be 

3 0 in an unfolded or improperly folded form or in a non- 
glycosylated form does not pose as great a concern 
because Fabs are not normally glycosylated and can be 
engineered for exported expression, thereby reducing the 
high concentration that facilitates misfolding. 
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Nevertheless, any recombinant Fab produced in a 
bacterial cell would be screened for retention of 
antigen binding ability. If the molecule expressed by 
the bacterial cell was produced and exported in a 
5 properly folded form, that bacterial cell would be a 

desirable host. For example, various strains of E. coli 
used for expression are well-known as host cells in the 
field of biotechnology. Various strains of B. subtilis, 
Streptomyces , other bacilli and the like may also be 

10 employed in this method. 

Where desired, strains of yeast cells known to 
those skilled in the art are also available as host 
cells, as well as insect cells, e.g. Drosophila and 
Lepidoptera and viral expression systems [see, e.g. 

15 Miller et al . , Genetic Engineering , 8:277-298, Plenum 
Press (1986) and references cited therein] . 

The general methods by which the vectors of the 
invention may be constructed, the transfection methods 
required to produce the host cells of the invention, and 

20 culture methods necessary to produce the altered 

antibody of the invention from such host cell are all 
conventional techniques. Likewise, once produced, the 
altered antibodies of the invention may be purified from 
the cell culture contents according to standard 

25 procedures of the art, including ammonium sulfate 

precipitation, affinity columns, column chromatography, 
gel electrophoresis and the like. Such techniques are 
within the skill of the art and do not limit this 
invention . 

30 Yet another method of expression of reshaped 

antibodies may utilize expression in a transgenic 
animal. An exemplary systems is described in U. S. 
Patent No. 4,873,316. The expression system described 
in that reference uses the animal's casein promoter and, 
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when transgenically incorporated into a mammal, permits 
the female to produce the desired recombinant protein in 
its milk. 

Once expressed by the desired method, the 
5 engineered antibody is then examined for in vitro 

activity by use of an appropriate assay. At present, 
conventional ELISA assay formats are employed to assess 
qualitative and quantitative binding of the altered 
antibody to RSV. Additionally, other in vitro assays 
10 and in vivo animal models may also be used to verify 

neutralizing efficacy prior to subsequent human clinical 
studies performed to evaluate the persistence of the 
altered antibody in the body despite the usual clearance 
mechanisms . 

15 VII . Therapeutic /Prophylactic Uses. 

This invention also relates to a method of treating 
humans experiencing RSV-related symptoms which comprises 
administering an effective dose of antibodies including 
one or more of the antibodies (altered, reshaped, 

20 monoclonal, etc.) described herein or fragments thereof. 

The therapeutic response induced by the use of the 
molecules of this invention is produced by binding to 
RSV and thus subsequently blocking RSV propagation. 
Thus, the molecules of the present invention, when in 

25 preparations and formulations appropriate for 

therapeutic use, are highly desirable for those persons 
experiencing RSV infection. For example, longer 
treatments may be desirable when treating seasonal 
episodes or the like. The dose and duration of 

30 treatment relates to the relative duration of the 
molecules of the present invention in the human 
circulation, and can be adjusted by one of skill in the 
art depending upon the condition being treated and the 
general health of the patient. 
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The altered antibodies, antibodies and fragments 
thereof of this invention may also be used alone or in 
conjunction with other antibodies, particularly human or 
humanized mAbs reactive with other epitopes on the F 
5 protein or other RSV target antigens as prophylactic 
agents . 

The mode of administration of the therapeutic and 
prophylactic agents of the invention may be any suitable 
route which delivers the agent to the host. The altered 

0 antibodies, antibodies, engineered antibodies, and 

fragments thereof, and pharmaceutical compositions of 
the invention are particularly useful for parenteral 
administration, i.e., subcutaneously , intramuscularly, 
intravenously, or intranasally . 

5 Therapeutic and prophylactic agents of the 

invention may be prepared as pharmaceutical compositions 
containing an effective amount of the altered antibody 
of the invention as an active ingredient in a 
pharmaceutically acceptable carrier. An aqueous 

0 suspension or solution containing the antibody, 

preferably buffered at physiological pH, in a form ready 
for injection is preferred. The compositions for 
parenteral administration will commonly comprise a 
solution of the engineered antibody of the invention or 

5 a cocktail thereof dissolved in an pharmaceutically 
acceptable carrier, preferably an aqueous carrier. A 
variety of aqueous carriers may be employed, e.g., 0.4% 
saline, 0.3% glycine, and the like. These solutions are 
sterile and generally free of particulate matter. These 

0 solutions may be sterilized by conventional, well known 
sterilization techniques (e.g., filtration) . The 
compositions may contain pharmaceutically acceptable 
auxiliary substances as required to approximate 
physiological conditions such as pH adjusting and 
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buffering agents, etc. The concentration of the 
antibody of the invention in such pharmaceutical 
formulation can vary widely, i.e. , from less than about 
0.5%, usually at or at least about 1% to as much as 15 
5 or 20% by weight and will be selected primarily based on 
fluid volumes, viscosities, etc., according to the 
particular mode of administration selected. 

Thus, a pharmaceutical composition of the invention 
for intramuscular injection could be prepared to contain 

10 1 mL sterile buffered water, and between about 1 ng to 
about 10 0 mg, e.g. about 5 0 ng to about 8 0 mg, or more 
preferably, about 5 mg to about 75 mg, of an engineered 
antibody of the invention. Similarly, a pharmaceutical 
composition of the invention for intravenous infusion 

15 could be made up to contain about 250 ml of sterile 
Ringer's solution, and about 1 to about 75 and 
preferably 5 to about 5 0 mg/ml of an engineered antibody 
of the invention. Actual methods for preparing 
parenterally adminis trable compositions are well known 

2 0 or will be apparent to those skilled in the art and are 

described in more detail in, for example, Remington's 
Pharmaceutical Science, 15th ed. , Mack Publishing 
Company, Easton, Pennsylvania. 

It is preferred that the therapeutic and 
25 prophylactic agents of the invention, when in a 

pharmaceutical preparation, be present in unit dose 
forms . The appropriate therapeutically effective dose 
can be determined readily by those of skill in the art. 
To effectively treat an inflammatory disorder in a human 

3 0 or other animal, one dose of approximately 0.1 mg to 

approximately 2 0 mg per 7 0 kg body weight of a protein 
or an antibody of this invention should be administered 
parenterally, preferably i.v. or i.m. (intramuscularly) . 
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Such dose may, if necessary, be repeated at appropriate 
time intervals selected as appropriate by a physician. 

The altered antibodies and engineered antibodies of 
this invention may also be used in diagnostic regimens, 
5 such as for the determination of RSV mediated disorders 
or tracking progress of treatment of such disorders. As 
diagnostic reagents, these altered antibodies may be 
conventionally labeled for use in ELISAs and other 
conventional assay formats for the measurement of RSV 

10 levels in serum, plasma or other appropriate tissue, or 
the release by human cells in culture. The nature of 
the assay in which the altered antibodies are used are 
conventional and do not limit this disclosure. 

The antibodies, altered antibodies or fragments 

15 thereof described herein can be lyophilized for storage 
and reconstituted in a suitable carrier prior to use. 
This technique has been shown to be effective with 
conventional immunoglobulins and art-known 
lyophilization and reconstitution techniques can be 

2 0 employed. 

The following examples illustrate various aspects 
of this invention including the construction of 
exemplary engineered antibodies and expression thereof 
in suitable vectors and host cells, and are not to be 
25 construed as limiting the scope of this invention. All 
amino acids are identified by conventional three letter 
or single letter codes. All necessary restriction 
enzymes, plasmids , and other reagents and materials were 
obtained from commercial sources unless otherwise 

3 0 indicated. All general cloning ligation and other 

recombinant DNA methodology were as performed in T. 
Maniatis et al . , cited above, or Sambrook et al . , cited 
above . 
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Example 1 : Isolation of GX-1 scFv-1 

Single chain (sc) Fv libraries were prepared from 
an individual purposely exposed to RSV and selected 
against recombinant RSV F-protein following described 
5 procedures [R. H. Jackson et al , in Protein Engineering, 
A Practical Approach, A. R. Rees et al eds , Oxford 
University Press, chapter 12, pp. 277-301, 1992; H. R. 
Hoogenboom et al . , Nucl . Acid Res . , 19: 4133-4137 
(1991); J. D. Marks et al . , J. Mol . Biol . , 222: 581-597 

10 (1991)] . Briefly, lymphocytes were isolated from a 

blood sample taken 15 days post exposure. RNA isolated 
from the lymphocytes was used for preparation of scFv 
encoding repertoires for phage display. Sets of V- 
region primers were paired with constant region primers 

15 for heavy chain domain 1 IgG and IgM and light chain C-k 
and C-X and then linked in a scFv VH-VL orientation with 
a 15 amino acid spacer (glycines-serine) 3 [ SEQ ID NO: 21] 
by overlap PCR [see J. D. Marks et al . , cited above, for 
description of the primers] . 

20 The resulting four scFv repertoires (V-K with IgG 

and IgM, V-X with IgG and IgM) were cloned into a 
phagemid vector similar to pHENl [H. R. Hoogenboom et 
al . , cited above] resulting in fusion of the scFvs to 
gene III of phage fd. The vector was then transformed 

25 into E. coli (e.g., strain TGI) by elec troporat ion to 
yield the corresponding phagemid libraries . 

Phage libraries displaying the scFv-gene 3 fusions 
were prepared by infection of each of the plasmid 
libraries with the M13K07 helper phage [R. H. Jackson, 

3 0 cited above] and were individually subjected to 2 rounds 
of panning against recombinant F-protein coated onto 
plastic. In the first round, 10 11 phage in 2.5 ml 
phosphate buffered saline (PBS)/2% Marval™ non-fat dry 
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milk were incubated for 90 minutes in a tube coated with 

5 |ig/ml of F-protein [described in P. Tsui et al , J . 
Immunol . , 157 : 772-780 (1996)] followed by 1 wash with 
lOx PBS/0.05% Tween 20 and a second wash with lOx PBS 
5 alone. Bound phage were eluted with 10 mM tr iethylamine 
and the eluate was neutralized with 1 M Tris-HCl, pH 
7.4. The eluted phage were amplified and subjected to a 
similar second round of panning, except that the 
concentration of F-protein for coating was 2 Jig/ml and 

10 the wash buffer contained 2 Ox PBS. 

E. coli were infected with the eluted phage and 96 
colonies from each starting library were superinf ec ted 
with helper phage and screened for F-protein binding 
activity. Only four positive clones were obtained from 

15 the 2 IgM libraries, whereas 41 positives were observed 
for the IgG libraries. By partial sequence analysis, 
all of the clones carried one of three different heavy 
chains . Complete sequences were obtained for the heavy 
and light chain V-regions for six clones, all from the 

20 IgG libraries. 

Serial dilutions of titered phage stocks of each of 
these six clones were tested by ELISA for binding to 
recombinant F-protein and to RSV infected cell lysate . 
All showed binding to F-protein with the phage 

25 designated GX-1 showing the best activity. However, G^-- 
1 and three other clones showed little binding to the 
RSV lysate. 

Three clones: GA.-1, G^-3 (lysate binding 
positive), and Gk-1 (lysate binding negative), where "K" 

30 and "X" designate the class of the light chain, were 

characterized further for competition of their binding 
by F-protein specific neutralizing monoclonal 
antibodies, and their ability to inhibit virus 
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infection. The neutralizing mAbs RSV19 and B4 described 
in International patent publication No. WO92/04381, 
published March 19, 1992, and International patent 
publication No. WO93/20210, published October 14, 1993, 
5 recognize distinct epitopes on the F-protein. GK-1 was 
strongly inhibited by both antibodies. GX-1 was 
significantly inhibited by B4 only. GK-3 was not 
inhibited by either antibody (shown for GX-1 only; see 
Figs. 1A and IB). In initial assays (Table I, 

10 experiments 1-3), all three clones showed neutralizing 

activity in vitro, with GA.-1 being the most potent (Fig. 
2, a graph of experiment 2), while control wild-type 
phage (M13K07) not displaying scFv had no effect. 

To address the possibility that neutralization 

15 might result just from phage coating of virus, 

irrespective of epitope, a phage preparation of the non- 
neutralizing Fab 5-16 was tested in the same assay. In 
three out of four assays, this preparation also showed 
good neutralization activity, as did the control phage 

20 in two of these assays (Table I, experiments 4-7). This 
confounding observation of variable neutralization by 
both Fab 5-16 and control M13K07 phage rendered the 
viral neutralization studies inconclusive. 
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Legend : 

10 1 Assay according to M. J. Cannon, J. Virol. Meth., 

16:293-301. Virus at 100 infectious centers/well 
was incubated with dilutions of the indicated phage 
for 1 hr and then added to susceptible cells for 3 
hr . The virus /phage solution was aspirated and 

15 replaced with fresh medium and the cells were 

incubated overnight before peroxidase staining for 
virus infected cells. 



aru = ampicillin resistance units, a measure of 
20 phagmid containing particles. 

kru = kanamycin resistance units, a measure of 
particles containing the phage genome (for the 
M13K07 control only) . 
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In the face of these results, made more ambiguous 
by the dependence of all assays on phage stocks verses 
antibody proteins of known concentration, GX-1 was 
selected as the most likely candidate for a potent 
5 neutralizing antibody based on (1) its apparent better 
binding to F-protein, (2) its selective inhibition of 
binding by the B4 antibody, and (3) its suggested 
activity over background in the virus neutralization 
assay . 

10 

Example 2 : Conversion of GA.-1 scFV to mAb Version A 

The DNA and encoded protein sequences of the VH and 
VL regions of GX-1 are shown in Figs. 3 [ SEQ ID NOS : 1 
and 2] and 4 [SEQ ID NOS: 3 and 4], respectively. For 

15 expression in mammalian cells, the heavy chain variable 
region and the light chain variable region from the GX-1 
plasmid were cloned into derivatives of plasmid pCDN 
[Nambi, A. et al . , Mol . Cell. Biochem. , 131 : 75-86 
(1994)] in which the expression of the antibody chain is 

20 driven by the cytomegalovirus promoter (CMV) promoter. 
Plasmid pCD-HC68B is used for expressing full length 
heavy chains and plasmid pCN-HuLC , for expressing full 
length light chains. 

In the initial constructs, changes in the sequence 

2 5 at the amino terminus were introduced by the PCR primers 
used for cloning the light chain and heavy chain 
variable regions from plasmid GX.-1 . In these 
constructs, the peptide signal sequence for both the 
heavy and light chains is derived from the Campath light 

30 chain [M. J. Page et al . , Biotechnology 9: 64-68 

(1991)]. The heavy chain of GA.-1 was PCR amplified from 
GX-1 phagemid DNA, using primers for the amino terminus 
and framework 4 of the variable region. The resulting 
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PCR fragment was cut with Xhol (site introduced by the 
amino terminus primer) and BstEII (naturally occurring 
site in framework 4) , and cloned into an intermediate 
vector, F4HCV, at the XhoI/BstEII sites. 
5 This cloning grafted the variable region of GX-1 

onto the constant region of another anti-RSV heavy chain 
194-F4 [cloned at SmithKline Beecham from a human 
hybridoma] . This intermediate clone was cut with Xhol 
and Bspl2 0l, and introduced into the same sites in pCD- 

10 HC68B. The Xhol site is introduced at the amino 

terminus by the PCR primer and, when cloned into pCD- 
HC68B at the same site is preceded in frame by the 
Campath leader sequence. The Bspl20l site is a 
naturally occurring, highly conserved sequence at the 

15 beginning of the C H -i domain, and when cloned into pCD- 
HC68B at the same site, is in frame with the remaining 
sequence for the C H -i through C H -3 regions of human IgGi . 
In the resulting construct, G>.-lApcd (Figs. 8A-8F [ SEQ 
ID NO: 13 ] ) , the amino acids immediately following the 

2 0 Campath leader are EVQLLE [ SEQ ID NO: 17] , where the 

residues LE are encoded by the nucleotide sequence for 
the Xhol cloning site. 

The light chain of G?l-1 was PCR amplified from the 
GX.-1 phagemid DNA, using primers for the amino terminus 
25 and framework 4 of the variable region. The resulting 
PCR fragment was cut with SacI (site introduced by the 
amino terminus primer) and Avrll (naturally occurring 
site in framework 4) , and cloned into 43-lpcn at the 
Sacl/Avrll sites. This cloning grafted the variable 

3 0 region of GX-1, in frame, onto the constant region of 

another anti-RSV lambda light chain 43 [P. Tsui et al . , 
J. Immunol . , 157: 772-780 (1996)], which had been cloned 
at SmithKline Beecham from a combinatorial library 
derived from RNA isolated from human spleen. The SacI 
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site is introduced at the amino terminus by the PCR 
primer and, when cloned into 43pcn at the same site, is 
preceded in frame by the Campath leader sequence. The 
first two amino acids of the mature light chain are 
5 therefore deleted. In the resulting construct, G/\.-lApcn 
(Figs. 9A-9E [ SEQ ID NO: 14]), the first two amino acids 
immediately following the leader are EL, where the 
residues EL are encoded by the nucleotide sequence for 
the SacI cloning site. 
10 The nucleotide sequences of the plasmids GX-lApcd 

and GX-lApcn are shown in Figs. 8A-8F [ SEQ ID NO: 13] 
and 9A-9E [SEQ ID NO: 14] respectively. This set of 
vectors was used to produce antibody GA.-1A in COS cells 
and in CHO cells. 

15 

Example 3 : Cloning Of The Corrected GX-1 Heavy and Light 
Chains 

In cloning the variable region of the GA-1 heavy 
chain from the single chain Fv (scFv) format into the 

20 full length format, the fifth amino acid at the amino 
terminus was changed from Val to Leu, for cloning 
purposes. To correct this change, PCR primers were 
designed for the amino terminus of the GA.-1 heavy chain 
cloned into pCD, which reverted the fifth amino acid 

25 back to Val. The correction was introduced via the PCR 
overlap technique using the correction primers and 
primers annealing to sequences within the CMV promoter 
and the C H -2 constant region as the outside 5 ' and 3 ' 
primers, respectfully. The final PCR product was 

30 digested with restriction enzymes, EcoRI and Bspl20l, 

and cloned into the GX-lApcd vector at the same sites to 
create G>.-lBpcd. 
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The final construct was sequenced to verify that 
the amino terminus of the heavy chain had been corrected 
from EVQLLE [SEQ ID NO: 17] to EVQLVE [ SEQ ID NO: 18] 
(see Fig 6) . The nucleotide sequence of coding region 
5 for the corrected heavy chain, GA.-1B, is shown in Figs. 
10A-10B [SEQ ID NO: 15] . 

In cloning the variable region of the GA.-1 light 
chain from the scFv format into the full length format, 
changes were introduced at the amino terminus for 

10 cloning purposes. Specifically, the first 2 amino acids 
(Gin and Ser) of the light chain were deleted and the 
third amino acid was changed from Val to Glu. To 
correct these changes, PCR primers were designed for the 
amino terminus of the GX-1 light chain cloned into pCN, 

15 which replaced the two deleted amino acids (Gin and Ser) 
and reverted the third amino acid back to Val . The 
corrections were introduced via the PCR overlap 
technique using the correction primers and primers 
annealing to sequences within the CMV promoter and the X 

20 constant region as the outside 5' and 3' primers, 

respectfully. The final PCR product was digested with 
restriction enzymes, EcoRI and Avrll and cloned into the 
G/l-lApcn vector at the same sites to create GX-lBpcn. 
The final construct was sequenced to verify that 

25 the amino terminus of the light chain had been corrected 
from — EL to QSVL (amino acids 1-4 of SEQ ID NO: 10) . 

The nucleotide sequence of coding region for the 
corrected light chain, GX-1B, is shown in Fig. 11 [SEQ 
ID NO : 16] . This vector GA.-lBpcn, was used with GX- 

3 0 lBpcd to produce antibody GX.-1B, in COS cells and in CHO 
cells . 
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Example 4 : Production of GX-1 mABs in Mammalian Cells 

For initial characterization, the mAb constructs 
for each version, GX-lA heavy and light chain, GX-1B 
heavy and light chain, were expressed in COS cells 
5 essentially as described in Current Protocols in 

Molecular Biology, eds F. M. Ausubel et al . , 1988, John 
Wiley Sc Sons, vol. 1, section 9.1. On day 1 after the 
transf ection , the culture growth medium was replaced 
with a serum-free medium [SmithKline Beecham] which was 

10 changed on day 3 . Similar satisfactory results are 
obtained using a publicly available medium, DMEM 
supplemented with ITS™ Premix, an insulin, transferrin, 
selenium mixture (Collaborative Research, Bedford, MA) 
and 1 mg/ml bovine serum albumin (BSA) . 

15 The mAb was prepared from the day 3 + day 5 

conditioned medium by standard protein A affinity 
chromatography methods (e.g., as described in Protocols 
in Molecular Biology) using, for example, Prosep A 
affinity resin ( Bioprocessing Ltd., UK) . 

20 To produce larger quantities of the GX-lB mAB (100- 

2 00 mgs) , the vectors were introduced into a proprietary 
CHO cell system. However, similar results will be 
obtained using dhfr" CHO cells as previously described 
[P. Hensley et al . , J. Biol. Chem . , 2 69 : 23 949 -23 95 8 

25 (1994)]. Briefly, a total of 30 Jig of linearized 

plasmid DNA (15 |ig each of the A or B set of heavy chain 
and light chain vectors) is elec troporated into lxlO 7 
cells. The cells are initially selected in nucleoside- 
free medium in 96 well plates. After three to four 

3 0 weeks, media from growth positive wells is screened for 
human immunoglobulin using an ELISA assay. The highest 
expressing colonies are expanded and selected in 
increasing concentrations of methotrexate for 
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amplification of the transfected vectors. The antibody 
is purified from conditioned medium by standard 
procedures using protein A affinity chromatography 
(Protein A sepharose, Pharmacia) followed by size 
5 exclusion chromatography (Superdex 200, Pharmacia) . 

The concentration and the antigen binding activity 
of the eluted antibody are measured by ELISA. The 
antibody containing fractions are pooled and further 
purified by size exclusion chromatography. As expected 

10 for any such antibody, by SDS-PAGE, the predominant 

protein product migrated at approximately 150 kd under 
non-reducing conditions and as two bands of 50 and 25 kd 
under reducing conditions. For antibody produced in CHO 
cells, the purity was > 90%, as judged by SDS-PAGE, and 

15 the concentration was accurately determined by amino 
acid analysis. 



Example 5 : Binding of the GX-1 mABs to recombinant F 
protein 

2 0 Binding of the GX-1 mABs to recombinant F protein 

was measured in a standard solid phase ELISA. Antigen 
diluted in PBS pH 7.0 was adsorbed onto polystyrene 
round-bottom microplates (Dynatech, Immunolon II) for 18 
hours. Wells were then aspirated and blocked with 0.5% 
25 boiled casein (BC) in PBS containing 1% Tween 20 

(PBS/0.05% BC) for two hours. Antibodies (50 (J.l/well) 
were diluted to varying concentrations in PBS/0.5% BC 
containing 0.025% Tween 20 and incubated in antigen 
coated wells for one hour. Plates were washed three 

3 0 times with PBS containing 0.05% Tween 20, using a 

Titertek 320 microplate washer, followed by addition of 
HRP-labelled protein A/G (50 jll ) diluted 1:5000. After 
washing three times, TMBlue substrate (TSI, #TM102) was 
added and plates were incubated an additional 15 
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minutes. The reaction was stopped by addition of 1 N 
H 2 S0 4 and absorbance read at 450 nm using a Biotek ELISA 
reader . 

The antigen binding epitope of the G^.-l mABs was 
5 examined in a competition ELISA. The GX-1 mABs were 
mixed with increasing concentrations of RSMU19 or B4, 
two potent neutralizing mAbs [Tempest et al . , Biotech . , 
9: 266-271 (1991); Kennedy et al . , J. Gen. Virol. , 69: 
3023-3032 (1988)] and added to F protein-coated wells. 

10 The epitope regions recognized by mAbs RSMU19 and B4 are 
quite distinct from each other as previously described 
in Arbiza et al . , J. Gen. Virol . , 73: 2225-2234 (1992). 
The concentration of the GX.-1 mABs used in competition 
studies was determined previously to give 90% maximal 

15 binding to F antigen. Binding of the GA,-1 mABs in the 
presence of other mABs was detected using HRP-labelled 
goat anti-human IgG . The reaction was developed as 
stated above. 

The GA.-1 mABs demonstrated potent binding to 

20 recombinant F (rF) protein by ELISA (EC 50 for mAB B = 2.6 
ng/ml) . Binding of the GA.-1 mABs to rF protein was 
inhibited by mAb B4 , for which the F protein amino acids 
critical for antigen recognition are amino acids 268, 
272 and 275 of SEQ ID NO : 20) . Binding of the GX-1 mABs 

25 to rF protein was not inhibited by mAb RSMU19, for which 
F protein amino acid 429 of SEQ ID NO: 20 is critical 
for antigen recognition. These results indicate that 
residues in the region of amino acids 255-275 of the F 

protein [SEQ ID NO: 20] are critical for G^-l mAB 
30 recognition. 
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Example 6 : In vitro Fusion-Inhibition Activity of the 

GX-1 mABs 

The ability of the GX.-1 mABs to inhibit virus- 
induced cell fusion was determined using a modification 
5 of the in vitro microneutralization assay [Beeler et 

al . , J. Virol . , 63:2941-2950 (1989)]. In this assay, 50 
(II of RS Long strain virus (10-100 TCID 50 /well [American 
Type Culture Collection ATCC VR-26] were mixed with 0.1 
ml VERO cells (5X10 3 /well) [ATCC CCL-81] in Minimum 
10 Essential Media (MEM) containing 2% fetal calf serum 

(FCS) , for 4 hours at 37°C, 5% C0 2 . Serial two-fold 
dilutions (in quadruplicate) of mAB (50 |il ) were then 
added to wells containing virus-infected cells. Control 
cultures contained cells incubated with virus only 
15 (positive virus control) or cells incubated with media 
alone . 

Cultures were incubated at 37°C in 5% C0 2 for 6 days 
at which time cytopathic effects (CPE) in virus control 
wells were > 90%. Microscopic examination for 

20 cytopathic effects were confirmed by ELISA. Media was 
aspirated from cultures and replaced with 50 [ll of 90% 
methanol containing 0.6% H 2 0 2 . After 10 minutes, 
fixative was aspirated and plates were air dried 
overnight. Viral antigen was detected in the fixed 

25 cultures using 1 (Jg/ml biotinylated RSCHB4 (a human Fc 
derivative of the bovine B4 mAb [SmithKline Beecham] ) , 
followed by HRP-labelled streptavidin (Boehringer- 
Mannheim) diluted 1:10,000. The reaction was developed 
using TMBlue and stopped by addition of IN H 2 S0 4 . 

3 0 Absorbance was measured at 45 0 nm (O.D.450) . 

Fusion-inhibition titers were defined as the 
concentration of antibody which caused a 50% reduction 
in ELISA signal (ED50) as compared to virus controls. 
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Based on the curve generated in the ELISA by the 
standard virus titration, a 50% reduction in O.D. 450 
corresponded to > 90% reduction in virus titer. 
Calculation of the 50% point was based on regression 
5 analysis of the dose titration. 

The GX-1 mABs demonstrated potent in vitro fusion- 
inhibition activity against type A RS Long strain virus 
(ED 50 for mAB B of 0.51 + 0.3 8 (ig/ml) . In this in vitro 
fusion-inhibition assay, GX-1 mAB B was more active than 
10 the humanized mAB RSHZ19 (ED 50 of 0.4-3.0 |Jg/ml) [Wyde et 
al . , Pediatr . Res . , 3 8 ( 4 ) : 543 -550 ] in comparative 
assays . 

Example 7 : In vivo Activity of GA.-1 mAB B: Prophylaxis 

15 and Therapy in Balb/c Mouse Model 

Balb/c mice (5/group) were inoculated 
intraperitoneally with doses ranging from 0.06 mg/kg to 
5 mg/kg of GX-1 mAB B either 24 hours prior 
(prophylaxis) or 4 days after (therapy) intranasal 

20 infection with 10 5 PFU of the A2 strain of human RSV. 

Mice were sacrificed 5 days after infection. Lungs were 
harvested and homogenized to determine virus titers. 

Virus was undetectable in the lungs of mice treated 
prophylactically with > 1.25 mg/kg GA.-1 mAB B either 

25 prophylactically or therapeutically. See Table II 

below. Significant viral clearance (2-3 logi 0 ) was also 
achieved in animals receiving 0.31 mg/kg GX-1 mAB B 
either prophylactically or therapeutically. 

30 
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Table II: GX-1 mAB B Prophylaxis and Therapy in Balb/c 
Mice 

Dose Lung Virus Titer (logio/g lung) 

Treatment ( mg / kg ) Prophylaxis Therapy 
5 "~" ~ 

G/.-l mAB B 5 <1 . 7 <1 . 7 

1.25 <1.7 <1.7 

0.31 1.8+0.3 2.9+0.4 

0.06 4.3+0.7 4.5+0.3 



10 



PBS - 4.8+0.7 4.7+0.2 



The GA.-1 mABs have potent antiviral activity 
in vitro against a broad range of native RSV isolates of 

15 both type A and B, and show prophylactic and therapeutic 
efficacy in vivo in animal models. Thus, the GX-1 mABs 
are candidates for therapeutic, prophylactic, and 
diagnostic application in man. 

Numerous modifications and variations of the 

2 0 present invention may be made by one of skill in the art 
in view of the invention described herein. Such 
modifications are believed to be encompassed by the 
specification and claims of the present invention. All 
references cited above are incorporated by reference 

25 herein. 
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1 . A human monoclonal antibody and 
functional fragments thereof, specifically reactive with 
an F protein epitope of Respiratory Syncytial Virus and 
capable of neutralizing infection by said virus selected 
from the group consisting of GX-1A and GX-lB. 

2 . The monoclonal antibody according to 
Claim 1 which comprises the light chain amino acid 
sequence of Fig. 3 SEQ ID NO: 2 and the heavy chain 
amino acid sequence of Fig. 4 SEQ ID NO: 4. 

3 . The monoclonal antibody according to 
Claim 1 which comprises the light chain amino acid 
sequence encoded by the DNA sequence of Fig. 11 SEQ ID 
NO: 16 and the heavy chain amino acid sequence encoded 
by the DNA sequence of Figs. 10A-10B SEQ ID NO: 15. 

4 . The monoclonal antibody according to 
Claim 1 wherein said fragment is selected from the group 
consisting of Fv, Fab and F(ab' ) 2 . 

5 . An isolated nucleic acid molecule 
selected from the group consisting of: 

(a) a nucleic acid sequence encoding any 
of the human monoclonal antibodies, altered antibodies 
and CDRs of any of the claims 1-4; 

(b) a nucleic acid complementary to any 
of the sequences in (a) ; and 

(c) a nucleic acid sequence of 18 or more 
nucleotides capable of hybridizing to the CDRs of any of 
claims 1-4 under stringent conditions. 
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6 . The isolated nucleic acid molecule 
according to Claim 5 comprising the sequences of Figs. 
8A-8F and 9A-9E SEQ ID NOS : 13 and 14, or Figs. 10A-10B 
and 11 SEQ ID NOS: 15 and 16. 

7 . A recombinant plasmid comprising the 
nucleic acid sequences of any of Claims 5 or 6 . 

8. A host cell comprising the plasmid of 

Claim 7 . 

9 . A process for the production of a human 
antibody specific for RSV comprising culturing the host 
cell of Claim 8 in a medium under suitable conditions of 
time temperature and pH and recovering the antibody so 
produced . 

10 . A method of detecting RSV comprising 
contacting a source suspected of containing RSV with a 
diagnostically effective amount of the monoclonal 
antibody of Claim 1 and determining whether the 
monoclonal antibody binds to the source. 

11 . A method for providing passive 
immunotherapy to RSV disease in a human, comprising 
administering to the human an immuno therapeutically 
effective amount of the monoclonal antibody of Claim 1 . 

12 . The method according to Claim 11 wherein 
the passive immunotherapy is provided prophylactically . 

13 . A pharmaceutical composition comprising 
at least one dose of an immuno therapeutically effective 
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amount of the monoclonal antibody of Claim 1 in a 
pharmaceutically acceptable carrier. 



14. A pharmaceutical composition comprising 
at least one dose of an immuno therapeutically effective 
amount of the monoclonal antibody of Claim 1 in 
combination with at least one additional monoclonal 
antibody . 

15. The pharmaceutical composition according 
to Claim 14 wherein said additional monoclonal antibody 
is an anti-RSV antibody distinguished from the antibody 
of Claim 1 by virtue of being reactive with a different 
epitope of the RSV F protein antigen. 
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Fig. 1A 

RSV19/GI1 scFv phage competition 




■GI1 

-GI1/RSV19 



0.00E+00 2.00E+08 4.00E+08 6.00E+08 8.00E+08 1.00E+09 

a.r.u 



B4/GI1 scFv phage competition 



0.45 -r 




O.OOE+00 2.00E+08 4.00E+08 6.00E+08 8.00E+08 1.00E+09 



Fig. IB 
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Fig. 2 



Neutralisation of RS/V/273 with phage Fv 




a.r.u. (x10 7 ) 



G lambda 3 G lambda 1 G kappa 1 
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FIGURE 3 

1 CAGTCTGTGTTGACGCAGCCGCCCTCAGTCTCTGCGGCCCCAGGACAGAA 5 0 

QSVLTQPPSVSAAPGQK 



51 GGTCACCATCTCCTGCACTGGGAGCAGCTCCAACCTCGGGGCAGGTTATG 10 0 
VTI SCTGS S SNLGAGYD 



101 ATGTTCACTGGTACCGGCAACTTCCAGGGACAGCCCCCAAACTCCTCATC 15 0 
VHWYRQLPGTAPKLLI 



151 TATGATAACAACAATCGGCCCTCAGGGGTCCCTGACCGATTCTCTGGCTC 20 0 
YDNNNRPSGVPDRFSGS 



2 01 CAAGTCTGGCCCCTCAGCCTCCCTGGCCATCTCTGGGCTCCAGGCTGAGG 25 0 
KSGPSASLAI SGLQAED 



251 ATGAGGCTGATTATTACTGCCAGTCCTATGACAGCAGCCTGAATGGTTAT 3 0 0 
EADYYCQSYDSSLNGY 



3 01 GTCTTCGGAACTGGGACCCAGCTCACCGTCCTAGGT 
VFGTGTQLTVLG 
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FIGURE 4 



1 GAGGTGCAGCTGGTGGAGTCTGGGGGAGGCTTGGTACAGCCTGGGGGGTC 
EVQLVESGGGLVQPGGS 



5 1 CCTGAGACTCTCCTGCGCAGCCTCTGGAGTCTCCCTCAGTGGATACAAGA 
LRLSCAASGVSLSGYKM 



101 TGAACTGGGTCCGCCAGGCTCCAGGGAAGGGGCTGGAATGGGTCTCTTCC 
NWVRQAPGKGLEWVSS 



151 ATTACTGGTATGAGTAATTACATACACTACTCAGACTCAGTGAAGGGCCG 
ITGMSNYIHYSDSVKGR 



2 01 ATTCACCATCTCCAGAGACAACGCCATGAACTCACTGTATCTGCAAATGA 
FTISRDNAMNSLYLQMN 



2 51 ACAGCCTGACAGCCGAGGACACGGGTGTTTATTATTGTGCGACACAACCG 
SLTAEDTGVYYCATQP 



3 01 GGGGAGCTGGCGCCTTTTGACCATTGGGGCCAGGGAACCCTGGTCACCGT 
GELAPFDHWGQGTLVTV 



3 51 CTCCTCA 
S S 
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Figure 5 




PCR " PCR 



/ \ 
Xhoi-BstEn Saci-Avrn 




Cotransfect into 
Mammalian Host Cells 

I 

Full Length lgG1 
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FIGURE 6 

Comparison of the Heavy Chain Amino Acid Sequences of the 
GA.-1 single chain fv and mAbs 

Leader and Variable Regions 



GL Dp5 8: EVQLVESGGGLVQPGGSLRLSCAASGFTFS 

GA.-1 SCFv: VSL- 

GA,-1A: MGWSCIILFLVATATGVHS L 

GX.-1B: V 

CDR1 CDR2 

GL Dp58: SYEMNWVRQAPGKGLEWVSYISSSGSTIYYADSVKGRFTISRDNAKNSLY 

GA,-1 SCFV: G-K S-TGMSNY-H-S M 

GA,-1A: 

GA.-1B: 

CDR3 

GL: Dp58: LQMNSLRAEDTAVYYCAR 

GX.-1 SCFv: T G TQPGEIiAPFDHWGQGTLVTVS S 

GX.-1A: 

GX-1B: 
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FIG 7 

Comparison of the Light Chain Amino Acid Sequences of the GX-1A: 
single chain Fv and mAbs 

Leader and Variable Regions 

CDRl 



GL DpL8: QSVLTQPPSVSGAPGQRVTISCTGSSSNIG 

Gk-l scFv: A K-- L- 

GX-IA: MGWSCIILFLVATATGVHS E 

GX-IB: QSV 

CDR2 

GL DpL8: AGYDVHWYQQLPGTAPKLLIYGNSNRPSGVPDRFSGSKSGTSASLAITGL 

GA.-lscFv: R D-N P S — 

GX-1A: 

GA.-1B: 

CDR3 

GL DpL8: QAEDEADYYC 

GA.-1 scFv: QSYDSSLNGYVFGTGTQLTVLG 

GA.-1A: 

GA.-1B: 
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FIGURE 8A 

1 gacgtcgcggccgctctaggcc tec aaaaaagcc tec tcac tact tctgg 

51 aatagctcagaggccgaggcggcc tcggcctc tgcataaataaaaaaaat 

101 tagtcagccatgcatggggcggagaatgggcggaactgggcggagttagg 

151 ggcgggatgggcggagttaggggcgggactatggttgctgactaattgag 

2 01 atgcatgctttgcatacttctgcctgctggggagcctggggactttccac 
251 acctggttgctgactaattgagatgcatgctttgcatacttctgcctgct 

3 01 ggggagcctggggactttccacaccc taac tgacacacattccacagaat 
3 51 taattcccggggatcgatccgtcgacgtacgactagttattaatagtaat 
401 caattacggggtcattagttcatagcccatatatggagttccgcgttaca 
451 taacttacggtaaatggcccgcctggctgaccgcccaacgacccccgccc 
5 01 attgacgtcaataatgacgtatgttcccatagtaacgccaatagggactt 
551 tccattgacgtcaatgggtggactatttacggtaaactgcccacttggca 
601 gtacatcaagtgtatcatatgccaagtacgccccc tattgaegtcaatga 
651 cggtaaatggcccgcctggcattatgcccagtacatgaccttatgggact 
701 t tec tact tggcagtacatct acg tat tag teatege tat taccatggtg 
751 atgcggttttggcagtacatcaatgggcgtggatagcggtttgactcacg 
801 gggatttccaagtctccaccccattgacgtcaatgggagtttgttttggc 
851 accaaaatcaacgggactttccaaaatgtcgtaacaactccgccccattg 
901 acgcaaatgggcggtaggcgtgtacggtgggaggtctatataagcagagc 

EcoRI 

951 tgggtacgtgaaccgtcagatcgcc tggagacgccatcgaattc tgagca 

10 01 cacaggacctcaccatgggatggagctgtatcatcctcttcttggtagca 

MGWSCI ILFLVA 
Leader start 

Xhol 

1051 acagctacaggtgtccactccgaggtccaac tg c tcgag tc tgggggagg 

T A T G V H S E V Q L L E S 

Processed N-term 
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FIGURE 8B 

1101 cttggtacagcctggggggtccc tgagactc tec tgcgcagcc tc tggag 

1151 tc tccctcagtggatacaagatgaactgggtccgccaggctccagggaag 

12 01 gggctggaatgggtctc ttccattactggtatgagtaattacatacacta 

12 51 c tcagactcagtgaagggccgattcaccatctccagagacaacgccatga 

13 01 ac tcactgtatctgcaaatgaacagcctgacagccgaggacacgggtgtt 

13 51 tattattgtgcgacacaaccgggggagctggcgcc ttttgaccattgggg 

BstEII Bspl20l 

14 01 ccagggaaccc tggtcaccgtc tec tcagcctccaccaagggcccat egg 

Q G T L ~" V T V S S / 

framework IV / CHI 

14 51 tc ttccccctggcaccc tec tccaagagcacctctgggggcacagcggcc 

15 01 ctgggctgcctggtcaaggactacttccccgaaccggtgacggtgtcgtg 

15 51 gaac tcaggcgccc tgaccagcggcgtgcacacct tcccggc tgtcc tac 

BstEII 

16 01 agtcc tcaggactc tac tccc tcagcagcgtggtgaccgtgccc tccagc 
1651 agcttgggcacccagacctacatctgcaacgtgaatcacaagcccagcaa 

17 01 caccaaggtggacaagaaagttgagcccaaatc t tgtgacaaaac tcaca 

17 51 catgcccaccgtgcccagcacctgaactcctggggggaccgtcagtc ttc 

18 01 ctcttccccccaaaacccaaggacaccctcatgatctcccggacccc tga 
1851 ggtcacatgcgtggtggtggacgtgagccacgaagaccctgaggtcaagt 

19 01 tcaac tggtacgtggacggcgtggaggtgcataatgccaagacaaagccg 
19 51 cgggaggagcagtacaacagcacgtaccgggtggtcagcgtcctcaccgt 
2 0 01 cc tgcaccaggac tggc tgaatggcaaggagtacaagtgcaaggtc t cca 
2 051 acaaagccctcccagcccccatcgagaaaaccatc tccaaagccaaaggg 
2101 cagccccgagaaccacaggtgtacaccc tgcccccatcccgggatgagc t 
2151 gaccaagaaccaggtcagcc tgacctgcc tggtcaaaggct tctatccca 
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FIGURE 8C 

22 01 gcgacatcgccgtggagtgggagagcaatgggcagccggagaacaac tac 

22 51 aagaccacgcc tcccgtgctggac tccgacggc tec t tc t tec tc tacag 
2 3 01 caagc tcaccgtggacaagagcaggtggcagcaggggaacg tc t tc teat 

23 51 gc tccgtgatgcatgaggc tc tgcacaaccac tacacgcagaagagcc tc 

2 4 01 tccc tgtc teegggtaaa tga tagatate tacgtatgatcagcc tcgac t 
S P G K * C-term of heavy chain 

2 451 gtgee t tc tagttgccagccatc tgttgtttgcccc tcccccgtgcc ttc 

2 5 01 c ttgaccc tggaaggtgccac tcccac tgtcc t t tec taataaaatgagg 

2 551 aaattgeategcat tgtc tgagtaggtgtcattc tat tc tggggggtggg 

2 6 01 gtggggcaggacagcaagggggaggattgggaagacaatagcaggcatgc 

2 651 tggggatgcggtgggc tc tatggaac cage tggggc tcgac age get gga 

2 7 01 tctcccgatccccagctttgcttctcaatttc ttatttgcataatgagaa 

2 751 aaaaaggaaaattaat tttaacaccaat tcagtagttgattgagcaaatg 

2 801 cgttgccaaaaaggatgc tttagagacagtgttc tetgeacagataagga 

2 851 caaacat tat tcagagggagtacccagagc tgagac tec taagccagtga 

2 901 gtggcacagcattc tagggagaaatatgc t tgtcatcaccgaagcc tgat 

2 951 tccgtagagccacacc ttggtaagggccaatctgctcacacaggatagag 

3 001 agggcaggagccagggcagagcatataaggtgaggtaggatcagttgctc 
3 051 c tcacat ttgcttctgacatagttgtgt tgggagc ttggatagcttggac 
3101 agctcagggc tgcgatttcgcgccaaacttgacggcaatcctagcgtgaa 
3151 ggc tggtaggatt t tat ccccgc tgeca tea tggt tcgac cat tgaactg 
3 2 01 catcgtcgccgtgtcccaaaatatggggat tggcaagaacggagacctac 
3 2 51 cctggcc teege tcaggaacgagt tcaagtac t t ccaaagaatgaccaca 
3 3 01 acctcttcagtggaaggtaaacagaatctggtgattatgggtaggaaaac 
3 3 51 c tggt tctccat tec tgagaagaatcgacctttaaaggacagaattaata 
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FIGURE 8D 

3 401 tagt tc tcagtagagaac tcaaagaaccaccacgaggagc tcattttc 1 1 

3 451 gccaaaagt t tggatgatgcc ttaagac t tat tgaacaaccggaattggc 

3 501 aagtaaagtagacatggt ttggatagtcggaggcagt tc tgt t taccagg 

3 551 aagccatgaatcaaccaggccacc ttagac tc 1 1 tgtgacaaggatcatg 

3 6 01 caggaat ttgaaagtgacacgttt t tcccagaaat tgat t tggggaaata 

3 651 taaac t tc tcccagaatacccaggcgtcc tc tc tgaggtccaggaggaaa 

37 01 aaggcatcaagtataagtttgaagtctacgagaagaaagactaacaggaa 

3 751 gatgctttcaagttc tc tgc tcccc tec taaagc tatgeatt t ttataag 

3 801 accatgggactt ttgctggct t tagatcagee t cgac tgtgcc t tc tagt 

3 851 tgccagccatc tgttgtttgcccctcccccgtgcc t tec t tgaccc tgga 

3 901 aggtgecac tcccac tgtcctt tec taataaaatgaggaaat tgeatege 

3 951 attgtc tgagtaggtgtcattc tattc tggggggtggggtggggcaggac 

40 01 agcaagggggaggattgggaagacaatagcaggcatgctggggatgcggt 

4051 gggc tc tatggaaccagc tggggc tcgatcgagtgtatgac tgcggccgc 

4101 gatcccgtcgagagcttggcgtaatcatggtcatagctgtttcctgtgtg 

4151 aaattgttatccgc tcacaat tccacacaacatacgagccggaagcataa 

42 01 agtgtaaagcctggggtgcctaatgagtgagctaactcacattaattgcg 

42 51 ttgege tcac tgcccgctttccagtcgggaaacc tgtcgtgccagctgca 

43 01 ttaatgaatcggccaacgcgcggggagaggcggt ttgcgtattgggcgct 

43 51 cttccgcttcc tege tcac tgac tcgctgcgct egg tcgttcggctgcgg 

44 01 cgagcggtatcagc tcac tcaaaggcggtaatacggttatccacagaatc 
4451 aggggataaegcaggaaagaaca tgtgagcaaaaggccagcaaaaggcca 

45 01 ggaaccgtaaaaaggccgcgttgctggcgtttttccataggctccgcccc 
4551 cc tgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaaaccc 

46 01 gacaggac tataaagataccaggegt ttccccctggaagctccctcgtgc 
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FIGURE 8E 

4651 gc tctcctgttccgaccc tgccgc t taccggatacc tgtccgcc tttc tc 

47 01 cc ttcgggaagcgtggcgctttc tcaatgc tcacgc tgtaggtatc tcag 

47 51 ttcggtgtaggtcgttcgc tccaagc tgggc tgtgtgcacgaaccccccg 

48 01 ttcagcccgaccgc tgcgcc ttatccggtaac tatcgtc ttgagtccaac 
4851 ccggtaagacacgac ttatcgccac tggcagcagccac tggtaacaggat 

49 01 tagcagagcgaggtatgtaggcggtgctacagagttcttgaagtggtggc 
4951 ctaac tacggctacactagaaggacagtatttggtatc tgcgc tctgctg 
5 0 01 aagccagt tacct tcggaaaaagagttggt age tctt gate cggcaaaca 
5051 aaccaccgc tggtagcggtggtt tt t t tgtt tgcaagcagcagat tacgc 
5101 gcagaaaaaaaggatctcaagaagatcctttgatcttttctacggggtct 
5151 gaege tcagtggaacgaaaac teaegttaagggat tttggtcatgagatt 

52 01 atcaaaaaggatc ttcacc tagatcc tt t taaattaaaaatgaagtt tta 
5251 aatcaatc taaagtatatatgagtaaacttggtctgacagttaccaatgc 

53 01 ttaatcagtgaggcacctatctcagcgatctgtctatttcgttcatccat 

53 51 agttgcctgactccccgtcgtgtagataac tacgataegggaggget tac 

54 01 catc tggccccagtgc tgcaatgataccgcgagacccacgc tcaccggct 
5451 ccagatttatcagcaataaaccagccagccggaagggccgagcgcagaag 
5501 tggtcc tgcaactttatccgcc tccatccagtc tattaattgttgccggg 
5551 aagctagagtaagtagttcgccagttaatagtttgcgcaacgttgttgcc 
5 601 attgctacaggcatcgtggtgtcacgctcgtcgtttggtatggcttcatt 
5 651 cagctccggttcccaacgatcaaggcgagttacatgatcccccatgttgt 
5701 gcaaaaaagcggttagctccttcggtcctccgatcgttgtcagaagtaag 
5751 t tggccgcagtgt tatcac tcatggttatggcagcactgcataattctct 
5801 tac tgtcatgccatccgtaagatgc t tttc tgtgac tggtgagtac tcaa 
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FIGURE 8F 

58 51 ccaagtcattc tgagaatagtgtatgcggcgaccgagt tgc tcttgcccg 

59 01 gcgtcaatacgggataataccgcgccacatagcagaac tttaaaagtgc t 

59 51 catcattggaaaacgttc tt cggggcgaaaac tctcaaggatct taccgc 

60 01 tgttgagatccagttcgatgtaacccactcgtgcacccaac tgatcttca 
6051 gcatc ttttac tt tcaccagcgt ttc tgggtgagcaaaaacaggaaggca 
6101 aaatgccgcaaaaaagggaataagggcgacacggaaatgttgaatac tea 
6151 tactc ttcctt tttcaatat tat tgaagcatt tatcagggt tat tgtc tc 
62 01 atgageggatacatatt tgaatgtatttagaaaaataaacaaataggggt 
62 51 tccgcgcacat ttccccgaaaagtgccacc t 
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FIGURE 9A 

1 gacgtcgcggccgc tc taggcctccaaaaaagcc tcctcaciacttc tgg 

51 aatagctcagaggccgaggcggcc tcggcc tc tgcataaataaaaaaaat 

101 tagtcagccatgcatggggcggagaatgggcggaactgggcggagttagg 

151 ggcgggatgggcggagt taggggcgggactatggttgctgac taattgag 

2 01 atgcatgc tttgca tact tc tgcct get ggggagee tggggac tttccac 

2 51 acctggttgctgactaattgagatgcatgctttgcatacttctgcctgct 

3 01 ggggagee tggggac tt tccacaccc taac tgacacacat tccacagaat 
3 51 taattcccggggatcgatccgtcgacgtacgactagttattaatagtaat 
401 caattacggggtcattagttcatagcccatatatggagttccgcgttaca 
451 taacttacggtaaatggcccgcctggctgaccgcccaacgacccccgccc 
501 attgacgtcaataatgacgtatgttcccatagtaacgccaatagggactt 
551 tccattgacgtcaatgggtggactatttacggtaaactgcccacttggca 
601 gtacatcaagtgtatcatatgccaagtacgccccctattgacgtcaatga 
651 cggtaaatggcccgcctggcattatgcccagtacatgaccttatgggact 
7 01 t t cc tac t tggcagtacatctacgt at tag tcatcgctattaccatggtg 
751 atgcggttttggcagtacatcaatgggcgtggatagcggtttgactcacg 
801 gggatttccaagtctccaccccattgacgtcaatgggagtttgttttggc 
851 accaaaatcaacgggactttccaaaatgtcgtaacaactccgccccattg 
9 01 acgcaaatgggcggtaggcgtgtacggtgggaggtctatataagcagagc 

EcoRI 

951 tgggtacgtgaaccgtcagatcgcctggagacgccatc gaattc tgagca 

1001 cacaggacctcaccatgggatggagctgtatcatcctcttc ttggtagca 

MGWSCIILFLVA 
Leader start 

SacI 

1051 acagctacaggtgtccactcc gagctc acgcagccgccctcagtctctgc 
T A T G V H S E L T Q -- 

Processed N-term 
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FIGURE 9B 

1101 ggccc caggacagaaggt caeca tc tec tgcac tgggagcagc tccaacc 
1151 tcggggcaggttatgatgttcac tggtaccggcaac ttccagggacagcc 
12 01 cccaaac tec teat eta tgataacaacaatcggccc tcaggggtccc tga 

12 51 ccgattctctggctccaagtctggcccctcagcctccctggccatctctg 

13 01 ggctccaggctgaggatgaggctgattattac tgccagtcc tatgacagc 

Avrll 

13 51 agee tgaatggttatgtc tteggaac tgggacccagc tcaccgt cctagg 

T Q L T V L G 
Framework IV / CX 

14 01 tcagcccaaggctgccccctcggtcactc tgt tcccgccc tec tc tgagg 

14 51 agct tcaagccaacaaggccacac tggtgtgtc tcataagtgac ttc tac 

15 01 ccgggagccgtgacagtggcc tggaaggcaat tagcagccccgtcaaggc 
1551 gggagtggagaccaccacaccc tccaaacaaagcaacaacaagtacgcgg 

16 01 ccagcagctatctgagcctgacgcctgagcagtggaagtcccacagaagg 
1651 tacagctgccaggtcacgcatgaagggagcaccgtggagaagacagtggc 

17 01 ccctacagaa tgt tea t ag ttctagatctacgtatgatcagcctcgactg 

P T E C S * C-term light chain 

17 51 tgccttctagttgccagccatctgttgtttgcccc tcccccgtgcct tec 

18 01 ttgaccctggaaggtgccactcccac tgtcctttcctaataaaatgagga 
1851 aattgcatcgcattgtc tgagtaggtgtcattc tattc tggggggtgggg 

19 01 tggggcaggacagcaagggggaggattgggaagacaatagcaggcatgct 
1951 ggggatgcggtgggctctatggaaccagctggggctcgacagctcgagct 
2 001 agctttgcttctcaatttcttatttgcataatgagaaaaaaaggaaaatt 
2 051 aatt ttaacaccaattcag tag ttgatt gage aaatgcgttgc caaaaag 
2101 gatgc tttagagacagtgt tc tc tgcacagataaggacaaacat tat tea 
2151 gagggagtacccagagctgagactcctaagccagtgagtggcacagcatt 
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FIGURE 9C 

22 01 c tagggagaaatatgc ttgtcatcaccgaagcc tgat tccgtagagccac 

22 51 acc t tggtaagggccaatc tgc tcacacaggatagagagggcaggagcca 

23 01 gggcagagcatataaggtgaggtaggatcagttgctcctcacatttgctt 
2 3 51 c tgacatagttgtgt tgggagc t tggatcgatccaccatggttgaacaag 
2 4 01 atggat tgcacgcaggt tc tccggccgct tgggtggagaggc tattcggc 
2 451 tatgactgggcacaacagacaatcggctgc tc tgatgccgccgtgttccg 
2 501 gc tgtcagcgcaggggcgcccggttc t tt t tgtcaagaccgacctgtccg 
2 551 gtgccc tgaatgaac tgcaggacgaggcagcgcggc tatcgtggc tggcc 
2 601 acgacgggcgttccttgcgcagc tgtgc tcgacgt tgtcac tgaagcggg 
2 651 aagggac tggc tgc tattgggcgaagtgccggggcaggatc tcctgtcat 
2 701 c tcacc t tgc tec tgc cgagaaagtatcca tea tggc tgatgeaatgegg 
2 7 51 egge tgeataege ttgatcegge tacc tgc c cat tcgaccaccaagcgaa 
2 801 acatcgcatcgagcgagcacgtac tcggatggaagccggtc ttgtcgatc 
2 851 aggatgatc tggacgaagagcatcaggggc tcgcgccagccgaac tgttc 
2 9 01 gecagge tcaaggcgcgcatgcccgacggcgaggatc tcgtcgtgaccca 

2 951 tggegatgee tgc ttgccgaatatcatggtggaaaatggccgc ttt tc tg 

3 001 gattcatcgactgtggccggctgggtgtggcggaccgctatcaggacata 
3 051 gcgttggc taccegtgatat tgc tgaagagc ttggcggcgaatgggc tga 
3101 cege ttcctcgtgctttacggtatcgccgctcccgattcgcagcgcatcg 
3151 cc ttc tatcgee t tct tgacgagt tc ttc tgagegggae t ctggggttcg 

32 01 aaatgaccgaccaagcgacgcccaacctgccatcacgagatttcgattcc 
3251 accgccgccttctatgaaaggt tgggct tcggaatcgttt tccgggacgc 

33 01 egge tggatgatcctccagcgcggggatc tea tgc tggagttc ttcgccc 
33 51 accccaac ttgt ttattgeage t tataatggttacaaataaagcaatagc 
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FIGURE 9D 

34 01 atcacaaatttcacaaataaagcatttttttcac tgcat tc tagttgtgg 

3 451 t ttgtccaaac tcatcaatgtatc ttatcatgtc tggatcgcggccgcga 

3 5 01 tcccgtcgagagc ttggcgtaatcatggtcatagc tgtttcc tgtgtgaa 

3 551 attgt tatccgc tcacaat tccacacaacatacgagccggaagcataaag 

3 6 01 tgtaaagcc tggggtgcc t aat gag t gage taac tcacat taattgcgtt 

3 651 gege tcac t gee cgett tccagtcgggaaacc tgtcgt gc cage tgcat t 

3 7 01 aatgaatcggccaacgcgcggggagaggcggt t tgegtat tgggegetc t 

3 7 51 teege t tec tege tcac tgactege tgege t egg teg ttegge tgcggcg 

3 8 01 ageggt at cage tcac tcaaaggcggtaatacggttatccacagaatcag 

3 851 gggataacgcaggaaagaacatgtgagcaaaaggccagcaaaaggccagg 

3 9 01 aaccgtaaaaaggccgcgt tgc tggcgtt t ttccataggc tccgcccccc 

3 9 51 tgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaaacccga 

40 01 caggactataaagataccaggcgtttccccctggaagctccctcgtgcgc 

4051 tc tec tgttccgaccctgccgcttaccggatacctgtccgcctttctccc 

4101 t tcgggaagcgtggcgc tttc tcaatgctcacgc tgtaggtatc tcagtt 

4151 cggtgtaggtcgttcgc tccaagctgggc tgtgtgcacgaaccccccgtt 

42 01 cagcccgaccgctgcgccttatccggtaactatcgtcttgagtccaaccc 

42 51 ggtaagacacgac t tatcgccac tggcagcagccac tggtaacaggatta 

43 01 gcagagcgaggtatgtaggcggtgc tacagagttc ttgaagtggtggcc t 

43 51 aac taegge tacac tagaaggacagtat t tggtatc tgege tc tgctgaa 

44 01 gccagttaccttcggaaaaagagttggtagctcttgatccggcaaacaaa 
4451 ccaccgctggtagcggtggtttttttgtttgcaagcagcagattacgcgc 
4501 agaaaaaaaggatc tcaagaagatcc tttgatc tt ttctacggggtc tga 
4551 cgctcagtggaacgaaaactcacgttaagggattt tggtcatgagattat 
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FIGURE 9E 

46 01 caaaaaggatc ttcacc tagatccttttaaattaaaaatgaagttttaaa 

46 51 tcaatctaaagtatatatgagtaaacttggtctgacagttaccaatgctt 

47 01 aatcagtgaggcacctatctcagcgatctgtc tat ttcgt teat cca tag 

47 51 t tgee tgactccccgtcgtgtagataactacgatacgggagggc t tacca 

48 01 tc tggccccagtgc tgcaatgataccgcgagacccacgctcaccggc tec 

48 51 agatt tatcagcaataaaccagccagccggaagggccgagcgcagaagtg 

49 01 gtcctgcaactttatccgcctccatccagtctattaattgttgccgggaa 

49 51 gc tagagtaagtagttcgccagt taatagtt tgcgcaacgt tgt tgecat 

50 01 tgctacaggcatcgtggtgtcacgctcgtcgtttggtatggcttcattca 
50 51 gc tccggttcccaacgatcaaggcgagttacatgatcccccatgttgtgc 
5101 aaaaaagcggttagctccttcggtcc tecgategt tgtcagaagtaagt t 
5151 ggccgcagtgttatcac tcatggttatggcagcac tgeataatte tc tta 
52 01 ctgtcatgccatccgtaagatgc ttttctgtgactggtgagtactcaacc 

52 51 aagtcattctgagaatagtgtatgcggcgaccgagttgctc ttgcccggc 

53 01 gtcaatacgggataataccgcgccacatagcagaactttaaaagtgc tea 

53 51 tcattggaaaacgttcttcggggcgaaaactc tcaaggatc ttaccgctg 

54 01 ttgagatccagttcgatgtaacccac tcgtgcacccaac tgatc ttcagc 
5451 atcttttactt tcaccagcgtttctgggtgagcaaaaacaggaaggcaaa 

55 01 atgccgcaaaaaagggaataagggcgacacggaaatgt tgaatac tcata 
55 51 ctcttcctttttcaatattattgaagcatttatcagggttattgtctcat 
5601 gagcggatacatatttgaatgtatttagaaaaataaacaaataggggttc 
5651 cgcgcacatttccccgaaaagtgccacct 
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FIGURE 10 A 

EcoRI 

gaattc tgagca 1000 

cacaggacc t caec a tg ggatggagc tgtatcatcctc ttc ttggtagca 105 0 

MGWSCIILFLVA 

acagctacaggtgtccactccgaggtgcagctggtggagtctgggggagg 110 0 
T A T G V H S E V Q L V E S - 

N- term 

cttggtacagcctggggggtccctgagactctcctgcgcagcctctggag 115 0 

tctccctcagtggatacaagatgaactgggtccgccaggctccagggaag 12 0 0 

gggctggaatgggtctc ttccat tac tggtatgagtaattacatacacta 12 5 0 

ctcagactcagtgaagggccgatt caeca tc tccagagacaacgccatga 13 0 0 

actcactgtatctgcaaatgaacagcctgacagccgaggacacgggtgtt 13 5 0 

tattattgtgcgacacaaccgggggagctggcgccttttgaccattgggg 14 0 0 

Bspl20l 

ccagggaaccctggtcaccgtctcctcagcctccaccaa gggccc atcgg 145 0 

tcttccccctggcaccctcctccaagagcacctctgggggcacagcggcc 15 0 0 

ctgggctgcctggtcaaggactacttccccgaaccggtgacggtgtcgtg 155 0 

gaactcaggcgccctgaccagcggcgtgcacaccttcccggctgtcctac 160 0 

agtcctcaggactc tactccctcagcagcgtggtgaccgtgccctccagc 165 0 

agcttgggcacccagacctacatctgcaacgtgaatcacaagcccagcaa 17 0 0 

caccaaggtggacaagaaagttgagcccaaatcttgtgacaaaactcaca 175 0 

catgcccaccgtgcccagcacctgaactcctggggggaccgtcagtcttc 18 0 0 

ctcttccccccaaaacccaaggacaccctcatgatctcccggacccctga 1850 

ggtcacatgcgtggtggtggacgtgagccacgaagaccctgaggtcaagt 19 0 0 

tcaactggtacgtggacggcgtggaggtgcataatgccaagacaaagccg 19 5 0 

cgggaggagcagtacaacagcacgtaccgggtggtcagcgtcctcaccgt 2 0 00 

cc tgcaccaggactggctgaatggcaaggagtacaagtgcaaggtctcca 2 05 0 
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FIGURE 10B 

acaaagccc tcccagcccccatcgagaaaaccatc tccaaagccaaaggg 210 0 

cagccccgagaaccacaggtgtacaccctgcccccatcccgggatgagc t 215 0 

gaccaagaaccaggtcagcc tgacc tgcc tggtcaaaggc t tc tatccca 22 0 0 

gcgacatcgccgtggagtgggagagcaatgggcagccggagaacaactac 22 5 0 

aagaccacgcctcccgtgctggac tccgacggctccttc ttcc tc tacag 23 0 0 

caagc tcaccgtggacaagagcaggtggcagcaggggaacgtc t tc teat 2 3 5 0 

gc tccgtgatgcatgaggc tc tgcacaaccactacacgcagaagagcc tc 2 4 0 0 

tccc tgtc teegggtaaa tga tagatatc t 
S P G K * 
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FIGURE 11 

EcoRI 

gaattc tgagca 1000 

cacaggacct caec a tg ggatggagc tgtatcatcc tc ttc ttggtagca 105 0 

MGWSCIILFLVA 

acagctacaggtgtccactcccagtctgtgttgacgcagccgccctcagt 110 0 
T A T G V H S Q S V L T Q - 

N- term 

ctctgcggccccaggacagaaggtcaccatctcctgcactgggagcagct 115 0 

ccaacctcggggcaggttatgatgttcactggtaccggcaacttccaggg 12 0 0 

acagcccccaaac tec teat eta tgataacaacaatcggccc tcaggggt 12 5 0 

ccctgaccgattc tc tggc tccaagtc tggcccc tcagcctccctggcca 13 0 0 

tctctgggctccaggctgaggatgaggctgattattactgccagtcctat 13 5 0 

gacagcagcc tgaatggttatgtc tteggaac tgggacccagctcaccgt 14 0 0 
Avrll 

cctaggtcagcccaaggctgccccc teggtcac tc tgttcccgccc tec t 14 5 0 

ctgaggagct tcaagccaacaaggccacactggtgtgtctcataagtgac 15 0 0 

ttctacccgggagccgtgacagtggcctggaaggcaattagcagccccgt 15 5 0 

caaggcgggagtggagaccaccacaccc tccaaacaaagcaacaacaagt 16 0 0 

acgcggccagcagctatctgagcctgacgcc tgagcagtggaagtcccac 165 0 

agaaggtacagctgccaggtcacgcatgaagggagcaccgtggagaagac 17 0 0 

agtggcccctacagaatgttca tag ttctagatctacgtatgatcagcct 17 5 0 
P T E C S * 
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(1) GENERAL INFORMATION: 

(i) APPLICANT: SmithKline Beecham, PLC 
(ii) TITLE OF INVENTION: Human Monoclonal Antibody 
(iii) NUMBER OF SEQUENCES: 21 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SmithKline Beecham Corporation 

(B) STREET: 7 09 Swedeland Road 

(C) CITY: King of Prussia 

(D) STATE: PA 

(E) COUNTRY: USA 

(F) ZIP: 19406-2799 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: GB 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: King, William T. 

(B) REGISTRATION NUMBER: 3 0,954 

(C) REFERENCE/DOCKET NUMBER: # 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 610-270-4800 

(B) TELEFAX : 610-270-4026 



(2) INFORMATION FOR SEQ ID NO : 1 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 336 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . .33 6 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CAG TCT GTG TTG ACG CAG CCG CCC TCA GTC TCT GCG GCC CCA GGA CAG 4 8 

Gin Ser Val Leu Thr Gin Pro Pro Ser Val Ser Ala Ala Pro Gly Gin 
15 10 15 



AAG GTC ACC ATC TCC TGC ACT GGG AGC AGC TCC AAC CTC GGG GCA GGT 



96 
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Lys Val Thr lie Ser Cys Thr Gly Ser Ser Ser Asn Leu Gly Ala Gly 

20 25 30 

TAT GAT GTT CAC TGG TAC CGG CAA CTT CCA GGG ACA GCC CCC AAA CTC 144 

Tyr Asd Val His Trp Tyr Arg Gin Leu Pro Gly Thr Ala Pro Lys Leu 

3 5 " " 40 45 

CTC ATC TAT GAT AAC AAC AAT CGG CCC TCA GGG GTC CCT GAC CGA TTC 192 

Leu lie Tyr Asp Asn Asn Asn Arg Pro Ser Gly Val Pro Asp Arg Phe 

50 55 60 

TCT GGC TCC AAG TCT GGC CCC TCA GCC TCC CTG GCC ATC TCT GGG CTC 24 0 

Ser Gly Ser Lys Ser Gly Pro Ser Ala Ser Leu Ala lie Ser Gly Leu 

65 7 0 7 5 8 0 

CAG GCT GAG GAT GAG GCT GAT TAT TAC TGC CAG TCC TAT GAC AGC AGC 2 88 

Gin Ala Glu Asp Glu Ala Asp Tyr Tyr Cys Gin Ser Tyr Asp Ser Ser 

85 90 95 

CTG AAT GGT TAT GTC TTC GGA ACT GGG ACC CAG CTC ACC GTC CTA GGT 33 6 

Leu Asn Gly Tyr Val Phe Gly Thr Gly Thr Gin Leu Thr Val Leu Gly 

100 105 110 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Gin Ser Val Leu Thr Gin Pro Pro Ser Val Ser Ala Ala Pro Gly Gin 

15 10 15 

Lys Val Thr lie Ser Cys Thr Gly Ser Ser Ser Asn Leu Gly Ala Gly 

20 " 25 30 

Tyr Asp Val His Trp Tyr Arg Gin Leu Pro Gly Thr Ala Pro Lys Leu 

35 40 45 

Leu lie Tyr Asp Asn Asn Asn Arg Pro Ser Gly Val Pro Asp Arg Phe 

50 55 60 

Ser Gly Ser Lys Ser Gly Pro Ser Ala Ser Leu Ala lie Ser Gly Leu 

65 " ' 70 75 80 

Gin Ala Glu Asp Glu Ala Asp Tyr Tyr Cys Gin Ser Tyr Asp Ser Ser 

85 90 95 

Leu Asn Gly Tyr Val Phe Gly Thr Gly Thr Gin Leu Thr Val Leu Gly 

100 105 110 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1 . .3 57 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



GAG GTG CAG CTG GTG GAG TCT GGG GGA GGC TTG GTA CAG CCT GGG GGG 4 8 

Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly Gly 
15 10 15 



TCC CTG AGA CTC TCC TGC GCA GCC TCT GGA GTC TCC CTC AGT GGA TAC 9 6 

Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Val Ser Leu Ser Gly Tyr 
2 0 2 5 3 0 



AAG ATG AAC TGG GTC CGC CAG GCT CCA GGG AAG GGG CTG GAA TGG GTC 14 4 

Lys Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val 
35 40 45 



TCT TCC ATT ACT GGT ATG AGT AAT TAC ATA CAC TAC TCA GAC TCA GTG 192 
Ser Ser lie Thr Gly Met Ser Asn Tyr lie His Tyr Ser Asp Ser Val 
50 55 60 



AAG GGC CGA TTC ACC ATC TCC AGA GAC AAC GCC ATG AAC TCA CTG TAT 240 
Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ala Met Asn Ser Leu Tyr 
65 70 75 80 



CTG CAA ATG AAC AGC CTG ACA GCC GAG GAC ACG GGT GTT TAT TAT TGT 2 88 

Leu Gin Met Asn Ser Leu Thr Ala Glu Asp Thr Gly Val Tyr Tyr Cys 
85 90 95 



GCG ACA CAA CCG GGG GAG CTG GCG CCT TTT GAC CAT TGG GGC CAG GGA 33 6 

Ala Thr Gin Pro Gly Glu Leu Ala Pro Phe Asp His Trp Gly Gin Gly 
100 105 110 



ACC CTG GTC ACC GTC TCC TCA 3 57 

Thr Leu Val Thr Val Ser Ser 
115 



(2) INFORMATION FOR SEQ ID NO : 4 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly Gly 
15 10 15 



Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Val Ser Leu Ser Gly Tyr 
2 0 2 5 3 0 



Lys Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val 
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45 



Ser Ser lie Thr Gly Met Ser Asn Tyr lie His Tyr Ser Aso Ser Val 
50 55 60 

Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ala Met Asn Ser Leu Tyr 
65 70 75 80 

Leu Gin Met Asn Ser Leu Thr Ala Glu Asp Thr Gly Val Tvr Tyr Cys 
8 5 9 0 " "95 

Ala Thr Gin Pro Gly Glu Leu Ala Pro Phe Asp His Trp Gly Gin Gly 
100 105 ~ lib 

Thr Leu Val Thr Val Ser Ser 
115 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val 
1 5 10 

Ser Leu Arg Leu Ser Cys Ala Ala 
20 

Lys Met Asn Trp Val Arg Gin Ala 
35 40 

Ser Ser lie Thr Gly Met Ser Asn 

50 55 



Leu Gin Met Asn 



Ala Thr Gin Pro 
100 

Thr Leu Val Thr 
115 



Ser Leu Thr Ala 
85 

Gly Glu Leu Ala 

Val Ser Ser 



Ser Gly Val Ser 
25 

Pro Gly Lys Gly 



Tyr lie His Tyr 
60 



Pro Phe Asp His 
105 



Gin Pro Gly Gly 
15 

Leu Ser Gly Tyr 
30 

Leu Glu Trp Val 
45 

Ser Asp Ser Val 



Asn Ser Leu Tyr 

8C 



Trp Gly Gin Gly 
110 



Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ala Met 
65 70 75 



Glu Asp Thr Gly Val Tyr Tyr Cys 
90 " 95 



INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly Gly 
15 10 15 

Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Ser Ser Tyr 
20 25 30 

Glu Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val 
35 40 45 

Ser Tyr lie Ser Ser Ser Gly Ser Thr lie Tyr Tyr Ala Asp Ser Val 
5 0 5 5 6 0 

Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ala Lys Asn Ser Leu Tyr 
65 70 75 80 

Leu Gin Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys 
8 5 9 0 9 5 

Ala Arg 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DE 

Met Gly Trp Ser 
1 

Val His Ser Glu 
20 

Pro Gly Gly Ser 
35 

Ser Gly Tyr Lys 

50 

Glu Trp Val Ser 
65 

Asp Ser Val Lys 



ICRIPTION: SEQ I 

Cys lie lie Leu 
5 

Val Gin Leu Leu 



Leu Arg Leu Ser 
40 

Met Asn Trp Val 
55 

Ser lie Thr Gly 
70 

Gly Arg Phe Thr 
85 



) NO : 7 : 

Phe Leu Val Ala 
10 

Glu Ser Gly Gly 
25 

Cys Ala Ala Ser 



Arg Gin Ala Pro 

60 

Met Ser Asn Tyr 
75 

lie Ser Arg Asp 
9 0 



Thr Ala Thr Gly 
15 

Gly Leu Val Gin 
30 

Gly Val Ser Leu 
45 

Gly Lys Gly Leu 



lie His Tyr Ser 
80 

Asn Ala Met Asn 
95 



Ser Leu Tyr Leu Gin Met Asn Ser Leu Thr Ala Glu Asp Thr Gly Val 
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110 



Tyr Tyr Cys Ala Thr Gin Pro Gly Glu Leu Ala Pro Phe Asp His Trp 
115 120 125 

Gly Gin Gly Thr Leu Val Thr Val Ser Ser 
130 135 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 8 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



Met Gly Trp Ser Cys lie lie Leu 
1 5 

Val His Ser Glu Val Gin Leu Val 
20 

Pro Gly Gly Ser Leu Arg Leu Ser 
35 40 

Ser Gly Tyr Lys Met Asn Trp Val 
50 55 

Glu Trp Val Ser Ser lie Thr Gly 
65 70 

Asp Ser Val Lys Gly Arg Phe Thr 
85 

Ser Leu Tyr Leu Gin Met Asn Ser 
100 

Tyr Tyr Cys Ala Thr Gin 
115 

Gly Gin Gly Thr Leu Val Thr Val 
130 135 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNES S : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



Phe Leu Val Ala Thr Ala Thr Gly 

10 15 

Glu Ser Gly Gly Gly Leu Val Gin 
25 30 

Cys Ala Ala Ser Gly Val Ser Leu 
45 

Arg Gin Ala Pro Gly Lys Gly Leu 
60 

Met Ser Asn Tyr lie His Tyr Ser 

7 5 "80 

He Ser Arg Asp Asn Ala Met Asn 

90 ~ 95 

Leu Thr Ala Glu Asp Thr Gly Val 
105 110 



Ser Ser 



Pro Gly Glu Leu Ala Pro Phe Asp His Trp 
120 125 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 
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Gin Ser Val Leu Thr Gin Pro Pro Ser Val Ser Ala Ala Pro Gly Gin 
15 10 15 

Lys Val Thr He Ser Cys Thr Gly Ser Ser Ser Asn Leu Gly Ala Gly 
20 25 30 

Tyr Asp Val His Trp Tyr Arg Gin Leu Pro Gly Thr Ala Pro Lys Leu 
3 5 4 0 4 5 

Leu lie Tyr Asp Asn Asn Asn Arg Pro Ser Gly Val Pro Asp Arg Phe 
5 0 5 5 '60 

Ser Gly Ser Lys Ser Gly Pro Ser Ala Ser Leu Ala lie Ser Gly Leu 
65 70 75 '80 

Gin Ala Glu Asp Glu Ala Asp Tyr Tyr Cys Gin Ser Tyr Asp Ser Ser 
85 90 95 

Leu Asn Gly Tyr Val Phe Gly Thr Gly Thr Gin Leu Thr Val Leu 
100 105 110 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



Gin Ser Val Leu Thr Gin Pro Pro 
1 5 

Arg Val Thr lie Ser Cys Thr Gly 
2 0 

Tyr Asp Val His Trp Tyr Gin Gin 
35 40 

Leu lie Tyr Gly Asn Ser Asn Arg 
50 55 

Ser Gly Ser Lys Ser Gly Thr 
65 70 

Gin Ala Glu Asp Glu Ala Asp 

8 5 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 8 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: unknown 



Ser Val Ser Gly Ala Pro Gly Gin 
10 15 

Ser Ser Ser Asn lie Gly Ala Gly 
25 30 

Leu Pro Gly Thr Ala Pro Lys Leu 
45 

Pro Ser Gly Val Pro Asp Arg Phe 
60 



Ser Ala Ser Leu Ala lie Thr Gly Leu 
75 80 



Tyr Tyr Cys 
9 0 
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o 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Gly Trp Ser Cys lie lie Leu Phe Leu Val Ala Thr Ala Thr Glv 
1 5 10 15 

Val His Ser Glu Leu Thr Gin Pro Pro Ser Val Ser Gly Ala Pro Gly 
2 0 2 5 '30 

Gin Arg Val Thr He Ser Cys Thr Gly Ser Ser Ser Asn He Gly Ala 
35 40 45 

Gly Tyr Asp Val His Trp Tyr Gin Gin Leu Pro Gly Thr Ala Pro Lys 
50 55 60 

Leu Leu lie Tyr Gly Asn Ser Asn Arg Pro Ser Gly Val Pro Asp Arg 
65 70 75 80 

Phe Ser Gly Ser Lys Ser Gly Thr Ser Ala Ser Leu Ala He Thr Gly 
85 90 95 

Leu Gin Ala Glu Asp Glu Ala Asp Tyr Tyr Cys Gin Ser Tyr Asp Ser 
100 105 110 

Ser Leu Asn Gly Tyr Val Phe Gly Thr Gly Thr Gin Leu Thr Val Leu 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

Met Gly Trp Ser Cys He He Leu Phe Leu Val Ala Thr Ala Thr Gly 

15 10 15 

Val His Ser Gin Ser Val Leu Thr Gin Pro Pro Ser Val Ser Gly Ala 
20 25 30 

Pro Gly Gin Arg Val Thr He Ser Cys Thr Gly Ser Ser Ser Asn He 
35 40 45 

Gly Ala Gly Tyr Asp Val His Trp Tyr Gin Gin Leu Pro Gly Thr Ala 
50 55 60 



Pro Lys Leu Leu He Tyr Gly Asn Ser Asn Arg Pro Ser Gly Val Pro 
65 70 75 80 
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Asp Arg Phe Ser Gly Ser Lys Ser Gly Thr Ser Ala Ser Leu Ala lie 
85 90 95 

Thr Gly Leu Gin Ala Glu Asp Glu Ala Asp Tyr Tyr Cys Gin Ser Tyr 
100 105 ~ ~ 110 

Asp Ser Ser Leu Asn Gly Tyr Val Phe Gly Thr Gly Thr Gin Leu Thr 
115 120 125 

Val Leu 
130 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6281 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GACGTCGCGG CCGCTCTAGG CCTCCAAAAA AGCCTCCTCA CTACTTCTGG AATAGCTCAG 6 0 

AGGCCGAGGC GGCCTCGGCC TCTGCATAAA TAAAAAAAAT TAGTCAGCCA TGCATGGGGC 12 0 

GGAGAATGGG CGGAACTGGG CGGAGTTAGG GGCGGGATGG GCGGAGTTAG GGGCGGGACT 180 

ATGGTTGCTG ACTAATTGAG ATGCATGCTT TGCATACTTC TGCCTGCTGG GGAGCCTGGG 2 40 

GACTTTCCAC ACCTGGTTGC TGACTAATTG AGATGCATGC TTTGCATACT TCTGCCTGCT 3 00 

GGGGAGCCTG GGGACTTTCC ACACCCTAAC TGACACACAT TCCACAGAAT TAATTCCCGG 3 60 

GGATCGATCC GTCGACGTAC GACTAGTTAT TAATAGTAAT CAATTACGGG GTCATTAGTT 42 0 

CATAGCCCAT ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC GCCTGGCTGA 480 

CCGCCCAACG ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTCCCAT AGTAACGCCA 54 0 

AT AGGGAC TT TCCATTGACG TCAATGGGTG GACTATTTAC GGTAAACTGC CCACTTGGCA 600 

GTACATCAAG TGTATCATAT GCCAAGTACG CCCCCTATTG ACGTCAATGA CGGTAAATGG 660 

CCCGCCTGGC ATTATGCCCA GTACATGACC TTATGGGACT TTCCTACTTG GCAGTACATC 72 0 

TACGTATTAG TCATCGCTAT TACCATGGTG ATGCGGTTTT GGCAGTACAT CAATGGGCGT 780 

GGATAGCGGT TTGACTCACG GGGATTTCCA AGTCTCCACC CCATTGACGT CAATGGGAGT 840 

TTGTTTTGGC ACCAAAATCA ACGGGACTTT CCAAAATGTC GTAACAACTC CGCCCCATTG 90 0 

ACGCAAATGG GCGGTAGGCG TGTACGGTGG GAGGTCTATA T AAGC AG AG C TGGGTACGTG 9 60 

AACCGTCAGA TCGCCTGGAG ACGCCATCGA ATTCTGAGCA CACAGGACCT CACCATGGGA 102 0 

TGGAGCTGTA TCATCCTCTT CTTGGTAGCA ACAGCTACAG GTGTCCACTC CGAGGTCCAA 10 8 0 
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CTGCTCGAGT 

GCCTCTGGAG 
GGGCTGGAAT 
GTGAAGGGCC 
AACAGCCTGA 
GCGCCTTTTG 
GGCCCATCGG 
CTGGGCTGCC 
GCCCTGACCA 
CTCAGCAGCG 
GTGAATCACA 
AAAACTCACA 
CTCTTCCCCC 
GTGGTGGTGG 
GTGGAGGTGC 
GTGGTCAGCG 
AAGGTCTCCA 
CAGCCCCGAG 
CAGGTCAGCC 
GAGAGCAATG 
GGCTCCTTCT 
GTCTTCTCAT 
TCCCTGTCTC 
GTTGCCAGCC 
CTCCCACTGT 
ATTCTATTCT 
GCAGGCATGC 
TCTCCCGATC 
ATTAATTTTA 
TAG AG A C AG T 
TGAGACTCCT 
GAAGCCTGAT 



CTGGGGGAGG 
TCTCCCTCAG 
GGGTCTCTTC 
GATTCACCAT 
CAGCCGAGGA 
AC C ATTGGGG 
TCTTCCCCCT 
TGGTCAAGGA 
GCGGCGTGCA 
TGGTGACCGT 
AGCCCAGCAA 
CATGCCCACC 
CAAAACCCAA 
ACGTGAGCCA 
ATAATGCCAA 
TCCTCACCGT 
ACAAAGCCCT 
AACCACAGGT 
TGACCTGCCT 
GGCAGCCGGA 
TCCTCTACAG 
GCTCCGTGAT 
CGGGTAAATG 
ATCTGTTGTT 
CCTTTCCTAA 
GGGGGGTGGG 
TGGGGATGCG 
CCCAGCTTTG 
AC AC C AATTC 
GTTCTCTGCA 
AAGCCAGTGA 
TCCGTAGAGC 



CTTGGTACAG 
TGGATACAAG 
CATTACTGGT 
CTCCAGAGAC 
CACGGGTGTT 
CCAGGGAACC 
GGCACCCTCC 
CTACTTCCCC 
CACCTTCCCG 
GCCCTCCAGC 
CACCAAGGTG 
GTGCCCAGCA 
GGACACCCTC 
CGAAGACCCT 
GACAAAGCCG 
CCTGCACCAG 
CCCAGCCCCC 
GTACACCCTG 
GGTCAAAGGC 
GAACAACTAC 
CAAGCTCACC 
GCATGAGGCT 
ATAGATATCT 
TGCCCCTCCC 
TAAAATGAGG 
GTGGGGCAGG 
GTGGGCTCTA 
CTTCTCAATT 
AGTAGTTGAT 
CAGATAAGGA 
GTGGCACAGC 
CACACCTTGG 



10 

CCTGGGGGGT 
ATGAACTGGG 
ATGAGTAATT 
AACGCCATGA 
TATTATTGTG 
CTGGTCACCG 
TCCAAGAGCA 
GAACCGGTGA 
GCTGTCCTAC 
AGCTTGGGCA 
GACAAGAAAG 
CCTGAACTCC 
ATGATCTCCC 
GAGGTCAAGT 
CGGGAGGAGC 
GACTGGCTGA 
ATCGAGAAAA 
CCCCCATCCC 
TTCTATCCCA 
AAGACCACGC 
GTGGACAAGA 
CTGCACAACC 
ACGTATGATC 
CCGTGCCTTC 
AAATTGCATC 
ACAGCAAGGG 
TGGAACCAGC 
TCTTATTTGC 
TGAGCAAATG 
CAAACATTAT 
ATTCTAGGGA 
TAAGGGCCAA 



CCCTGAGACT 
TCCGCCAGGC 
ACATACACTA 
ACTCACTGTA 
CGACACAACC 
TCTCCTCAGC 
CCTCTGGGGG 
CGGTGTCGTG 
AGTCCTCAGG 
CCCAGACCTA 
TTGAGCCCAA 
TGGGGGGACC 
GGACCCCTGA 
TCAACTGGTA 
AGTACAACAG 
ATGGCAAGGA 
CCATCTCCAA 
GGGATGAGCT 
GCGACATCGC 
CTCCCGTGCT 
GCAGGTGGCA 
ACTACACGCA 
AGCCTCGACT 
CTTGACCCTG 
GCATTGTCTG 
GGAGGATTGG 
TGGGGCTCGA 
ATAATGAGAA 
CGTTGCCAAA 
TCAGAGGGAG 
GAAATATGCT 
TCTGCTCACA 
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CTCCTGCGCA 114 0 

TCCAGGGAAG 12 0 0 

CTCAGACTCA 12 6 0 

TCTGCAAATG 13 2 0 

GGGGGAGCTG 13 8 0 

CTCCACCAAG 144 0 

CACAGCGGCC 150 0 

GAACTCAGGC 15 6 0 

ACTCTACTCC 162 0 

CATCTGCAAC 168 0 

ATCTTGTGAC 174 0 

GTCAGTCTTC 1800 

GGTCACATGC 1860 

CGTGGACGGC 192 0 

CACGTACCGG 198 0 

GTACAAGTGC 2 040 

AGCCAAAGGG 2100 

G AC CAAGAAC 216 0 

CGTGGAGTGG 222 0 

GGACTCCGAC 22 8 0 

GCAGGGGAAC 23 4 0 

GAAGAGCCTC 2400 

GTGCCTTCTA 246 0 

GAAGGTGCCA 2 52 0 

AGTAGGTGTC 2 58 0 

GAAGACAATA 2 64 0 

CAGCGCTGGA 2700 

AAAAAGGAAA 2 7 6 0 

AAGGATGCTT 2 82 0 

TACCCAGAGC 2 880 

TGTCATCACC 2 94 0 

CAGGATAGAG 3 0 0 0 
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AGGGCAGGAG CCAGGGCAGA GCATATAAGG TGAGGTAGGA TCAGTTGCTC CTCACATTTG 3 06 0 

CTTCTGACAT AGTTGTGTTG GGAGCTTGGA TAGCTTGGAC AGCTCAGGGC TGCGATTTCG 312 0 

CGCCAAACTT GACGGCAATC CTAGCGTGAA GGCTGGTAGG ATTTTATCCC CGCTGCCATC 318 0 

ATGGTTCGAC CATTGAACTG CATCGTCGCC GTGTCCCAAA ATATGGGGAT TGGCAAGAAC 3 24 0 

GGAGACCTAC CCTGGCCTCC GCTCAGGAAC GAGTTCAAGT ACTTCCAAAG AATGACCACA 3300 

ACCTCTTCAG TGGAAGGTAA ACAGAATCTG GTGATTATGG GTAGGAAAAC CTGGTTCTCC 3 3 60 

ATTCCTGAGA AGAATCGACC TTTAAAGGAC AGAATTAATA TAGTTCTCAG TAGAGAACTC 3 42 0 

AAAGAACCAC CACGAGGAGC TCATTTTCTT GCCAAAAGTT TGGATGATGC CTTAAGACTT 3480 

ATTGAACAAC CGGAATTGGC AAGTAAAGTA GACATGGTTT GGATAGTCGG AGGCAGTTCT 3 54 0 

GTTTACCAGG AAGCCATGAA TCAACCAGGC CACCTTAGAC TCTTTGTGAC AAGGATCATG 3 60 0 

CAGGAATTTG AAAGTGACAC GTTTTTCCCA GAAATTGATT TGGGGAAATA TAAACTTCTC 3 660 

CCAGAATACC CAGGCGTCCT CTCTGAGGTC CAGGAGGAAA AAGGCATCAA GTATAAGTTT 3 72 0 

GAAGTCTACG AGAAGAAAGA CTAACAGGAA GATGCTTTCA AGTTCTCTGC TCCCCTCCTA 37 8 0 

AAGCTATGCA TTTTTATAAG ACCATGGGAC TTTTGCTGGC TTTAGATCAG CCTCGACTGT 3 84 0 

GCCTTCTAGT TGCCAGCCAT CTGTTGTTTG CCCCTCCCCC GTGCCTTCCT TGACCCTGGA 3 90 0 

AGGTGCCACT CCCACTGTCC TTTCCTAATA AAATGAGGAA ATTGCATCGC ATTGTCTGAG 3 960 

TAGGTGTCAT TCTATTCTGG GGGGTGGGGT GGGGCAGGAC AGCAAGGGGG AGGATTGGGA 4 02 0 

AGACAATAGC AGGCATGCTG GGGATGCGGT GGGCTCTATG GAACCAGCTG GGGCTCGATC 40 8 0 

GAGTGTATGA CTGCGGCCGC GATCCCGTCG AGAGCTTGGC GTAATCATGG TCATAGCTGT 414 0 

TTCCTGTGTG AAATTGTTAT CCGCTCACAA TTCCACACAA CATACGAGCC GGAAGCATAA 42 0 0 

AGTGTAAAGC CTGGGGTGCC TAATGAGTGA GCTAACTCAC ATTAATTGCG TTGCGCTCAC 42 60 

TGCCCGCTTT CCAGTCGGGA AACCTGTCGT GCCAGCTGCA TTAATGAATC GGCCAACGCG 432 0 

CGGGGAGAGG CGGTTTGCGT ATTGGGCGCT CTTCCGCTTC CTCGCTCACT GACTCGCTGC 43 8 0 

GCTCGGTCGT TCGGCTGCGG CGAGCGGTAT CAGCTCACTC AAAGGCGGTA ATACGGTTAT 4 44 0 

CCACAGAATC AGGGGATAAC GCAGGAAAGA ACATGTGAGC AAAAGGCCAG CAAAAGGCCA 4500 

GGAACCGTAA AAAGGCCGCG TTGCTGGCGT TTTTCCATAG GCTCCGCCCC C C TG AC GAG C 4560 

ATCACAAAAA TCGACGCTCA AGTCAGAGGT GGCGAAACCC GACAGGACTA TAAAGATACC 4 62 0 

AGGCGTTTCC CCCTGGAAGC TCCCTCGTGC GCTCTCCTGT TCCGACCCTG CCGCTTACCG 4 680 

GATACCTGTC CGCCTTTCTC CCTTCGGGAA GCGTGGCGCT TTCTCAATGC TCACGCTGTA 474 0 

GGTATCTCAG TTCGGTGTAG GTCGTTCGCT CCAAGCTGGG CTGTGTGCAC GAACCCCCCG 4 80 0 

TTCAGCCCGA CCGCTGCGCC TTATCCGGTA ACTATCGTCT TGAGTCCAAC CCGGTAAGAC 4 86 0 
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ACGACTTATC GCCACTGGCA GCAGCCACTG GTAACAGGAT TAGCAGAGCG AGGTATGTAG 4 92 0 

GCGGTGCTAC AGAGTTCTTG AAGTGGTGGC CTAACTACGG C T AC AC TAG A AGGACAGTAT 49 8 0 

TTGGTATCTG CGCTCTGCTG AAGCCAGTTA CCTTCGGAAA AAGAGTTGGT AGCTCTTGAT 5 040 

CCGGCAAACA AACCACCGCT GGTAGCGGTG GTTTTTTTGT TTGCAAGCAG CAGATTACGC 510 0 

GCAGAAAAAA AGGATCTCAA GAAGATCCTT TGATCTTTTC TACGGGGTCT GACGCTCAGT 5160 

GGAACGAAAA CTCACGTTAA GGGATTTTGG TCATGAGATT ATCAAAAAGG ATCTTCACCT 522 0 

AGATCCTTTT AAATTAAAAA TGAAGTTTTA AATCAATCTA AAGTATATAT GAGTAAACTT 5 2 80 

GGTCTGACAG TTACCAATGC TTAATCAGTG AGGCACCTAT CTCAGCGATC TGTCTATTTC 5 3 40 

GTTCATCCAT AGTTGCCTGA CTCCCCGTCG TGTAGATAAC TACGATACGG GAGGGC TTAC 5400 

CATCTGGCCC CAGTGCTGCA ATGATACCGC GAGACCCACG CTCACCGGCT CCAGATTTAT 5 4 60 

CAGCAATAAA CCAGCCAGCC GGAAGGGCCG AGCGCAGAAG TGGTCCTGCA ACTTTATCCG 5 52 0 

CCTCCATCCA GTCTATTAAT TGTTGCCGGG AAGCTAGAGT AAGTAGTTCG CCAGTTAATA 5 5 80 

GTTTGCGCAA CGTTGTTGCC ATTGCTACAG GCATCGTGGT GTCACGCTCG TCGTTTGGTA 5 64 0 

TGGCTTCATT CAGCTCCGGT TCCCAACGAT CAAGGCGAGT TACATGATCC CCCATGTTGT 57 0 0 

GCAAAAAAGC GGTTAGCTCC TTCGGTCCTC CGATCGTTGT CAGAAGTAAG TTGGCCGCAG 5 76 0 

TGTTATCACT CATGGTTATG GCAGCACTGC ATAATTCTCT TACTGTCATG CCATCCGTAA 5 82 0 

GATGCTTTTC TGTGACTGGT GAGTACTCAA CCAAGTCATT C TGAGAAT AG TGTATGCGGC 5880 

GACCGAGTTG CTCTTGCCCG GCGTCAATAC GGGATAATAC CGCGCCACAT AGCAGAACTT 5 94 0 

TAAAAGTGCT CATCATTGGA AAACGTTCTT CGGGGCGAAA ACTCTCAAGG ATCTTACCGC 6 00 0 

TGTTGAGATC CAGTTCGATG TAACCCACTC GTGCACCCAA CTGATCTTCA GCATCTTTTA 6 06 0 

CTTTCACCAG CGTTTCTGGG TGAGCAAAAA CAGGAAGGCA AAATGCCGCA AAAAAGGGAA 612 0 

TAAGGGCGAC ACGGAAATGT TGAATACTCA TACTCTTCCT TTTTCAATAT TATTGAAGCA 6180 

TTTATCAGGG TTATTGTCTC ATGAGCGGAT ACATATTTGA ATGTATTTAG AAAAATAAAC 62 4 0 

AAATAGGGGT TCCGCGCACA TTTCCCCGAA AAGTGCCACC T 62 81 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5679 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
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GACGTCGCGG CCGCTCTAGG CCTCCAAAAA AGCCTCCTCA CTACTTCTGG AATAGCTCAG 6 0 

AGGCCGAGGC GGCCTCGGCC TCTGCATAAA TAAAAAAAAT TAGTCAGCCA TGCATGGGGC 12 0 

GGAGAATGGG CGGAACTGGG CGGAGTTAGG GGCGGGATGG GCGGAGTTAG GGGCGGGACT 18 0 

ATGGTTGCTG ACTAATTGAG ATGCATGCTT TGCATACTTC TGCCTGCTGG GGAGCCTGGG 24 0 

GACTTTCCAC ACCTGGTTGC TGACTAATTG AGATGCATGC TTTGCATACT TCTGCCTGCT 300 

GGGGAGCCTG GGGACTTTCC ACACCCTAAC TGACACACAT TCCACAGAAT TAATTCCCGG 3 60 

GGATCGATCC GTCGACGTAC GACTAGTTAT TAATAGTAAT CAATTACGGG GTCATTAGTT 42 0 

CATAGCCCAT ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC GCCTGGCTGA 48 0 

CCGCCCAACG ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTCCCAT AGTAACGCCA 54 0 

ATAGGGACTT TC C ATTGACG TCAATGGGTG GACTATTTAC GGTAAACTGC CCACTTGGCA 600 

GTACATCAAG TGTATCATAT GCCAAGTACG CCCCCTATTG ACGTCAATGA CGGTAAATGG 66 0 

CCCGCCTGGC ATTATGCCCA GTACATGACC TTATGGGACT TTCCTACTTG GCAGTACATC 72 0 

TACGTATTAG TCATCGCTAT TACCATGGTG ATGCGGTTTT GGCAGTACAT CAATGGGCGT 78 0 

GGATAGCGGT TTGACTCACG GGGATTTCCA AGTCTCCACC CCATTGACGT CAATGGGAGT 84 0 

TTGTTTTGGC ACCAAAATCA ACGGGACTTT CCAAAATGTC GTAACAACTC CGCCCCATTG 900 

ACGCAAATGG GCGGTAGGCG TGTACGGTGG GAGGTCTATA TAAGCAGAGC TGGGTACGTG 960 

AACCGTCAGA TCGCCTGGAG ACGCCATCGA ATTCTGAGCA C AC AGG AC C T C AC C ATGGGA 102 0 

TGGAGC TGTA TCATCCTCTT CTTGGTAGCA ACAGCTACAG GTGTCCACTC CGAGCTCACG 1080 

CAGCCGCCCT CAGTCTCTGC GGCCCCAGGA CAGAAGGTCA CCATCTCCTG C AC TGGGAGC 114 0 

AGCTCCAACC TCGGGGC AGG TTATGATGTT CACTGGTACC GGCAACTTCC AGG G AC AG C C 12 0 0 

CCCAAACTCC TCATCTATGA TAACAACAAT CGGCCCTCAG GGGTCCCTGA CCGATTCTCT 12 6 0 

GGCTCCAAGT CTGGCCCCTC AGCCTCCCTG GCCATCTCTG GGCTCCAGGC TGAGGATGAG 13 2 0 

GCTGATTATT ACTGCCAGTC C TATGAC AG C AGCCTGAATG GTTATGTCTT CGGAACTGGG 13 80 

ACCCAGCTCA CCGTCCTAGG TCAGCCCAAG GCTGCCCCCT CGGTCACTCT GTTCCCGCCC 144 0 

TCCTCTGAGG AGCTTCAAGC CAACAAGGCC ACACTGGTGT GTCTCATAAG TGACTTCTAC 15 0 0 

CCGGGAGCCG TGACAGTGGC CTGGAAGGCA ATTAGCAGCC CCGTCAAGGC GGGAG TGG AG 15 6 0 

ACCACCACAC CCTCCAAACA AAGCAACAAC AAGTACGCGG CCAGCAGCTA TCTGAGCCTG 162 0 

ACGCCTGAGC AGTGGAAGTC CCACAGAAGG TACAGCTGCC AGGTCACGCA TGAAGGGAGC 1680 

ACCGTGGAGA AGACAGTGGC CCCTACAGAA TGTTCATAGT TCTAGATCTA CGTATGATCA 174 0 

GCCTCGACTG TGCCTTCTAG TTGCCAGCCA TCTGTTGTTT GCCCCTCCCC CGTGCCTTCC 1800 

TTGACCCTGG AAGGTGCCAC TCCCACTGTC CTTTCCTAAT AAAATGAGGA AATTGCATCG 1860 

CATTGTCTGA GTAGGTGTCA TTCTATTCTG GGGGGTGGGG TGGGGCAGGA CAGCAAGGGG 192 0 
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GAGGATTGGG AAGACAATAG CAGGCATGCT GGGGATGCGG TGGGCTCTAT GGAACCAGCT 19 8 0 

GGGGCTCGAC AGCTCGAGCT AGCTTTGCTT CTCAATTTCT TATTTGCATA ATGAGAAAAA 2 04 0 

AAGGAAAATT AATTTTAACA CCAATTCAGT AGTTGATTGA GCAAATGCGT TGCCAAAAAG 210 0 

GATGCTTTAG AGACAGTGTT CTCTGCACAG ATAAGGACAA ACATTATTCA GAGGGAGTAC 216 0 

CCAGAGCTGA GACTCCTAAG CCAGTGAGTG GCACAGCATT CTAGGGAGAA ATATGCTTGT 22 2 0 

CATCACCGAA GCCTGATTCC GTAGAGCCAC ACCTTGGTAA GGGCCAATCT GCTCACACAG 22 8 0 

GATAGAGAGG GCAGGAGCCA GGGCAGAGCA TATAAGGTGA GGTAGGATCA GTTGCTCCTC 234 0 

ACATTTGCTT CTGACATAGT TGTGTTGGGA GCTTGGATCG ATCCACCATG GTTGAACAAG 2 400 

ATGGATTGCA CGCAGGTTCT CCGGCCGCTT GGGTGGAGAG GCTATTCGGC TATGACTGGG 2 46 0 

CACAACAGAC AATCGGCTGC TCTGATGCCG CCGTGTTCCG GCTGTCAGCG CAGGGGCGCC 2 52 0 

CGGTTCTTTT TGTCAAGACC GACCTGTCCG GTGCCCTGAA TGAACTGCAG GACGAGGCAG 2 580 

CGCGGCTATC GTGGCTGGCC ACGACGGGCG TTCCTTGCGC AGCTGTGCTC GACGTTGTCA 2 64 0 

CTGAAGCGGG AAGGG AC TGG CTGCTATTGG GCGAAGTGCC GGGGCAGGAT CTCCTGTCAT 2 70 0 

CTCACCTTGC TCCTGCCGAG AAAGTATCCA TCATGGCTGA TGCAATGCGG CGGCTGCATA 2 760 

CGCTTGATCC GGCTACCTGC CCATTCGACC ACCAAGCGAA ACATCGCATC GAGCGAGCAC 2 82 0 

GTACTCGGAT GGAAGCCGGT CTTGTCGATC AGGATGATCT GGACGAAGAG CATCAGGGGC 2 88 0 

TCGCGCCAGC CGAACTGTTC GCCAGGCTCA AGGCGCGCAT GCCCGACGGC GAGGATCTCG 2 94 0 

TCGTGACCCA TGGCGATGCC TGCTTGCCGA ATATCATGGT GGAAAATGGC CGCTTTTCTG 3 00 0 

GATTCATCGA CTGTGGCCGG CTGGGTGTGG CGGACCGCTA TCAGGACATA GCGTTGGCTA 3 06 0 

CCCGTGATAT TGC TGAAGAG CTTGGCGGCG AATGGGCTGA CCGCTTCCTC GTGCTTTACG 312 0 

GTATCGCCGC TCCCGATTCG CAGCGCATCG CCTTCTATCG CCTTCTTGAC GAGTTCTTCT 318 0 

GAGCGGGACT CTGGGGTTCG AAATG AC CGA CCAAGCGACG CCCAACCTGC CATCACGAGA 324 0 

TTTCGATTCC ACCGCCGCCT TCTATGAAAG GTTGGGCTTC GGAATCGTTT TCCGGGACGC 33 0 0 

CGGCTGGATG ATCCTCCAGC GCGGGGATCT CATGCTGGAG TTCTTCGCCC ACCCCAACTT 33 6 0 

GTTTATTGCA GCTTATAATG GTTACAAATA AAGCAATAGC ATCACAAATT TCACAAATAA 3 42 0 

AGCATTTTTT TCACTGCATT CTAGTTGTGG TTTGTCCAAA CTCATCAATG TATCTTATCA 3 48 0 

TGTCTGGATC GCGGCCGCGA TCCCGTCGAG AGCTTGGCGT AATCATGGTC ATAGCTGTTT 3 54 0 

CCTGTGTGAA ATTGTTATCC GCTCACAATT CCACACAACA TACGAGCCGG AAG C ATAAAG 3 600 

TGTAAAGCCT GGGGTGCCTA ATG AGTGAG C TAACTCACAT TAATTGCGTT GCGCTCACTG 3 660 

CCCGCTTTCC AGTCGGGAAA CCTGTCGTGC CAGCTGCATT AATGAATCGG CCAACGCGCG 3 72 0 

GGGAGAGGCG GTTTGCGTAT TGGGCGCTCT TCCGCTTCCT CGCTCACTGA CTCGCTGCGC 378 0 
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TCGGTCGTTC GGCTGCGGCG AGCGGTATCA GCTCACTCAA AGGCGGTAAT ACGGTTATCC 3 84 0 

ACAGAATCAG GGGATAACGC AGGAAAGAAC ATGTGAGCAA AAGGCCAGCA AAAGGCCAGG 3 9 00 

AACCGTAAAA AGGCCGCGTT GCTGGCGTTT TTCCATAGGC TCCGCCCCCC TGACGAGCAT 3 9 60 

CACAAAAATC GACGCTCAAG TCAGAGGTGG CGAAACCCGA CAGGACTATA AAG AT AC C AG 4 02 0 

GCGTTTCCCC CTGGAAGCTC CCTCGTGCGC TCTCCTGTTC CGACCCTGCC GCTTACCGGA 4080 

TACCTGTCCG CCTTTCTCCC TTCGGGAAGC GTGGCGCTTT CTCAATGCTC ACGCTGTAGG 414 0 

TATCTCAGTT CGGTGTAGGT CGTTCGCTCC AAGCTGGGCT GTGTGCACGA ACCCCCCGTT 42 0 0 

CAGCCCGACC GCTGCGCCTT ATCCGGTAAC TATCGTCTTG AGTCCAACCC GG T AAG AC AC 42 6 0 

GAC TTATCGC CACTGGCAGC AGCCACTGGT AACAGGATTA GCAGAGCGAG GTATGTAGGC 43 2 0 

GGTGCTACAG AGTTCTTGAA GTGGTGGCCT AACTACGGCT AC AC TAG AAG GACAGTATTT 43 8 0 

GGTATCTGCG CTCTGCTGAA GCCAGTTACC TTCGGAAAAA GAGTTGGTAG CTCTTGATCC 4440 

GGCAAACAAA CCACCGCTGG TAGCGGTGGT TTTTTTGTTT GCAAGCAGCA GATTACGCGC 4500 

AGAAAAAAAG GATCTCAAGA AGATCCTTTG ATCTTTTCTA CGGGGTCTGA CGCTCAGTGG 45 6 0 

AACGAAAACT CACGTTAAGG GATTTTGGTC ATGAGATTAT CAAAAAGGAT CTTCACCTAG 4 62 0 

ATCCTTTTAA ATTAAAAATG AAGTTTTAAA TCAATCTAAA GTATATATGA GTAAACTTGG 4680 

TCTGACAGTT ACCAATGCTT AATCAGTGAG GCACCTATCT CAGCGATCTG TCTATTTCGT 474 0 

TC ATC CAT AG TTGCCTGACT CCCCGTCGTG TAGATAACTA CGATACGGGA GGGCTTACCA 4 8 00 

TCTGGCCCCA GTGCTGCAAT GATACCGCGA GACCCACGCT CACCGGCTCC AGATTTATCA 4860 

GCAATAAACC AGCCAGCCGG AAGGGCCGAG CGCAGAAGTG GTCCTGCAAC TTTATCCGCC 492 0 

TCCATCCAGT CTATTAATTG TTGCCGGGAA GCTAGAGTAA GTAGTTCGCC AGTTAATAGT 4980 

TTGCGCAACG TTGTTGCCAT TGCTACAGGC ATCGTGGTGT CACGCTCGTC GTTTGGTATG 5 04 0 

GCTTCATTCA GCTCCGGTTC CCAACGATCA AGGCGAGTTA CATGATCCCC CATGTTGTGC 5100 

AAAAAAGCGG TTAGCTCCTT CGGTCCTCCG ATCGTTGTCA GAAGTAAGTT GGCCGCAGTG 5160 

TTATCACTCA TGGTTATGGC AGCACTGCAT AATTCTCTTA CTGTCATGCC ATCCGTAAGA 522 0 

TGCTTTTCTG TGACTGGTGA GTACTCAACC AAGTCATTCT GAGAATAGTG TATGCGGCGA 52 8 0 

CCGAGTTGCT CTTGCCCGGC GTCAATACGG GATAATACCG CGCCACATAG CAGAACTTTA 53 4 0 

AAAGTGCTCA TCATTGGAAA ACGTTCTTCG GGGCGAAAAC TCTCAAGGAT CTTACCGCTG 5 4 00 

TTGAGATCCA GTTCGATGTA ACCCACTCGT GCACCCAACT GATCTTCAGC ATCTTTTACT 5460 

TTCACCAGCG TTTCTGGGTG AGCAAAAACA GGAAGGCAAA ATGCCGCAAA AAAGGGAATA 5 52 0 

AGGGCGACAC GGAAATGTTG AATACTCATA CTCTTCCTTT TTCAATATTA TTGAAGCATT 5 5 80 

TATCAGGGTT ATTGTCTCAT GAGCGGATAC ATATTTGAAT GTATTTAGAA AAATAAACAA 5 64 0 

ATAGGGG TTC CGCGCACATT TCCCCGAAAA GTGCCACCT 5 67 9 
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(2) INFORMATION FOR SFQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1442 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GAATTCTGAG CACACAGGAC CTCACCATGG GATGGAGCTG TATCATCCTC TTCTTGGTAG 6 0 

CAACAGCTAC AGGTGTCCAC TCCGAGGTGC AGCTGGTGGA GTCTGGGGGA GGCTTGGTAC 12 0 

AGCCTGGGGG GTCCCTGAGA CTCTCCTGCG CAGCCTCTGG AGTCTCCCTC AGTGGATACA 180 

AGATGAACTG GGTCCGCCAG GCTCCAGGGA AGGGGCTGGA ATGGGTCTCT TCCATTACTG 240 

GTATGAGTAA TTACATACAC TACTCAGACT CAGTGAAGGG CCGATTCACC ATCTCCAGAG 3 00 

ACAACGCCAT GAACTCACTG TATCTGCAAA TGAACAGCCT GACAGCCGAG GACACGGGTG 3 60 

TTTATTATTG TGCGACACAA CCGGGGGAGC TGGCGCCTTT TGACCATTGG GGCCAGGGAA 42 0 

CCCTGGTCAC CGTCTCCTCA GCCTCCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT 480 

CCTCCAAGAG CACCTCTGGG GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC 54 0 

CCGAACCGGT GACGGTGTCG TGGAACTCAG GCGCCCTGAC CAGCGGCGTG CACACCTTCC 600 

CGGCTGTCCT ACAGTCCTCA GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA 660 

GCAGCTTGGG CACCCAGACC TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG 72 0 

TGGACAAGAA AGTTGAGCCC AAATCTTGTG ACAAAACTCA CACATGCCCA CCGTGCCCAG 7 80 

CACCTGAACT CCTGGGGGGA CCGTCAGTCT TCCTCTTCCC CCCAAAACCC AAGGACACCC 84 0 

TCATGATCTC CCGGACCCCT GAGGTCACAT GCGTGGTGGT GGACGTGAGC CACGAAGACC 900 

CTGAGGTCAA GTTCAACTGG TACGTGGACG GCGTGGAGGT GCATAATGCC AAGACAAAGC 96 0 

CGCGGGAGGA GCAGTACAAC AGCACGTACC GGGTGGTCAG CGTCCTCACC GTCCTGCACC 102 0 

AGGACTGGCT GAATGGCAAG GAGTACAAGT GCAAGGTCTC CAACAAAGCC CTCCCAGCCC 108 0 

CCATCGAGAA AACCATCTCC AAAGCCAAAG GGCAGCCCCG AGAAC C AC AG GTGTACACCC 114 0 

TGCCCCCATC CCGGGATGAG CTGACCAAGA ACCAGGTCAG CCTGACCTGC CTGGTCAAAG 12 0 0 

GCTTCTATCC CAGCGACATC GCCGTGGAGT GGGAGAGCAA TGGGCAGCCG GAGAACAACT 12 6 0 

ACAAGACCAC GCCTCCCGTG CTGGACTCCG ACGGCTCCTT CTTCCTCTAC AGCAAGCTCA 13 2 0 

CCGTGGACAA GAGCAGGTGG CAGCAGGGGA ACGTCTTCTC ATGCTCCGTG ATGCATGAGG 13 8 0 

CTCTGCACAA CCACTACACG CAGAAGAGCC TCTCCCTGTC TCCGGGTAAA TGATAGATAT 144 0 
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CT 1442 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

GAATTCTGAG CACACAGGAC CTCACCATGG GATGGAGCTG TATCATCCTC TTCTTGGTAG 6 0 

CAACAGCTAC AGGTGTCCAC TCCCAGTCTG TGTTGACGCA GCCGCCCTCA GTCTCTGCGG 12 0 

CCCCAGGACA GAAGGTCACC ATCTCCTGCA CTGGGAGCAG CTCCAACCTC GGGGCAGGTT 180 

ATGATGTTCA CTGGTACCGG CAACTTCCAG GGACAGCCCC CAAACTCCTC ATCTATGATA 2 40 

ACAACAATCG GCCCTCAGGG GTCCCTGACC GATTCTCTGG CTCCAAGTCT GGCCCCTCAG 3 00 

CCTCCCTGGC CATCTCTGGG CTCCAGGCTG AGGATGAGGC TGATTATTAC TGCCAGTCCT 3 60 

ATGACAGCAG CCTGAATGGT TATGTCTTCG GAACTGGGAC CCAGCTCACC GTCCTAGGTC 42 0 

AGCCCAAGGC TGCCCCCTCG GTCACTCTGT TCCCGCCCTC CTCTGAGGAG CTTCAAGCCA 480 

ACAAGGCCAC ACTGGTGTGT CTCATAAGTG ACTTCTACCC GGGAGCCGTG ACAGTGGCCT 5 40 

GGAAGGCAAT TAGCAGCCCC GTCAAGGCGG GAGTGGAGAC CACCACACCC TCCAAACAAA 6 00 

GCAACAACAA GTACGCGGCC AGCAGC TATC TGAGCCTGAC GCCTGAGCAG TGGAAGTCCC 6 60 

ACAGAAGGTA CAGCTGCCAG GTCACGCATG AAGGGAGCAC CGTGGAGAAG ACAGTGGCCC 720 

C T AC AG AATG TTCATAGTTC TAGATCTACG TATGATCAGC CT 7 62 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Glu Val Gin Leu Leu Glu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 18: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 8 : 

Glu Val Gin Leu Val Glu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 19: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1899 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 14.. 1735 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GGGGCAAATA ACA ATG GAG TTG CTA ATC CTC AAA GCA AAT GCA ATT ACC 49 
Met Glu Leu Leu lie Leu Lys Ala Asn Ala lie Thr 
15 10 

ACA ATC CTC ACT GCA GTC ACA TTT TGT TTT GCT TCT GGT CAA AAC ATC 9 7 

Thr lie Leu Thr Ala Val Thr Phe Cys Phe Ala Ser Gly Gin Asn lie 

15 20 25 

ACT GAA GAA TTT TAT CAA TCA ACA TGC AGT GCA GTT AGC AAA GGC TAT 14 5 

Thr Glu Glu Phe Tyr Gin Ser Thr Cys Ser Ala Val Ser Lys Gly Tyr 
3 0 3 5 4 0 

CTT AGT GCT CTG AGA ACT GGT TGG TAT ACC AGT GTT ATA ACT ATA GAA 193 
Leu Ser Ala Leu Arg Thr Gly Trp Tyr Thr Ser Val lie Thr lie Glu 
45 50 55 60 

TTA AGT AAT ATC AAG GAA AAT AAG TGT AAT GGA ACA GAT GCT AAG GTA 241 
Leu Ser Asn lie Lys Glu Asn Lys Cys Asn Gly Thr Asp Ala Lys Val 
65 70 75 

AAA TTG ATA AAA CAA GAA TTA GAT AAA TAT AAA AAT GCT GTA ACA GAA 2 89 

Lys Leu lie Lys Gin Glu Leu Asp Lys Tyr Lys Asn Ala Val Thr Glu 
80 85 90 

TTG CAG TTG CTC ATG CAA AGC ACA CCA CCA ACA AAC AAT CGA GCC AGA 337 
Leu Gin Leu Leu Met Gin Ser Thr Pro Pro Thr Asn Asn Arg Ala Arg 
95 100 105 
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AGA GAA CTA CCA AGG TTT ATG AAT TAT ACA CTC AAC AAT GCC AAA AAA 3 85 

Arg Glu Leu Pro Arg Phe Met Asn Tyr Thr Leu Asn Asn Ala Lys Lys 

110 115 120 



ACC AAT GTA ACA TTA AGC AAG AAA AGG AAA AGA AGA TTT CTT GGT TTT 43 3 

Thr Asn Val Thr Leu Ser Lys Lys Arg Lys Arg Arg Phe Leu Gly Phe 
125 130 135 " 140 



TTG TTA GGT GTT GGA TCT GCA ATC GCC AGT GGC GTT GCT GTA TCT AAG 481 
Leu Leu Gly Val Gly Ser Ala lie Ala Ser Gly Val Ala Val Ser Lys 
145 150 155 



GTC CTG CAC CTA GAA GGG GAA GTG AAC AAG ATC AAA AGT GCT CTA CTA 52 9 

Val Leu His Leu Glu Gly Glu Val Asn Lys lie Lys Ser Ala Leu Leu 
160 165 170 



TCC ACA AAC AAG GCT GTA GTC AGC TTA TCA AAT GGA GTT AGT GTC TTA 57 7 

Ser Thr Asn Lys Ala Val Val Ser Leu Ser Asn Gly Val Ser Val Leu 
17 5 180 18 5 



ACC AGC AAA GTG TTA GAC CTC AAA AAC TAT ATA GAT AAA CAA TTG TTA 62 5 

Thr Ser Lys Val Leu Asp Leu Lys Asn Tyr lie Asp Lys Gin Leu Leu 
190 195 200 



CCT ATT GTG AAC AAG CAA AGC TGC AGC ATA TCA AAT ATA GAA ACT GTG 673 
Pro lie Val Asn Lys Gin Ser Cys Ser He Ser Asn He Glu Thr Val 
205 210 215 220 



ATA GAG TTC CAA CAA AAG AAC AAC AGA CTA CTA GAG ATT ACC AGG GAA 721 
lie Glu Phe Gin Gin Lys Asn Asn Arg Leu Leu Glu lie Thr Arg Glu 
225 230 235 



TTT AGT GTT AAT GCA GGT GTA ACT ACA CCT GTA AGC ACT TAC ATG TTA 769 
Phe Ser Val Asn Ala Gly Val Thr Thr Pro Val Ser Thr Tyr Met Leu 
240 245 250 



ACT AAT AGT GAA TTA TTG TCA TTA ATC AAT GAT ATG CCT ATA ACA AAT 817 
Thr Asn Ser Glu Leu Leu Ser Leu lie Asn Asp Met Pro lie Thr Asn 
255 260 265 



GAT CAG AAA AAG TTA ATG TCC AAC AAT GTT CAA ATA GTT AGA CAG CAA 8 65 

Asp Gin Lys Lys Leu Met Ser Asn Asn Val Gin lie Val Arg Gin Gin 
270 275 280 



AGT TAC TCT ATC ATG TCC ATA ATA AAA GAG GAA GTC TTA GCA TAT GTA 913 
Ser Tyr Ser He Met Ser lie lie Lys Glu Glu Val Leu Ala Tyr Val 
285 290 295 300 



GTA CAA TTA CCA CTA TAT GGT GTT ATA GAT ACA CCC TGT TGG AAA CTA 9 61 

Val Gin Leu Pro Leu Tyr Gly Val He Asp Thr Pro Cys Trp Lys Leu 
305 310 315 



CAC ACA TCC CCT CTA TGT ACA ACC AAC ACA AAA GAA GGG TCC AAC ATC 10 09 

His Thr Ser Pro Leu Cys Thr Thr Asn Thr Lys Glu Gly Ser Asn He 
320 325 330 



TGT TTA ACA AGA ACT GAC AGA GGA TGG TAC TGT GAC AAT GCA GGA TCA 10 57 

Cys Leu Thr Arg Thr Asp Arg Gly Trp Tyr Cys Asp Asn Ala Gly Ser 
335 340 345 



GTA TCT TTC TTC CCA CAA GCT GAA ACA TGT AAA GTT CAA TCA AAT CGA 1105 
Val Ser Phe Phe Pro Gin Ala Glu Thr Cys Lys Val Gin Ser Asn Arg 
350 355 360 
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GTA TTT TGT GAC ACA ATG AAC AGT TTA ACA TTA CCA AGT GAA ATA AAT 1153 
Val Phe Cys Asp Thr Met Asn Ser Leu Thr Leu Pro Ser Glu lie Asn 
365 370 375 380 

CTC TGC AAT GTT GAC ATA TTC AAC CCC AAA TAT GAT TGT AAA ATT ATG 12 01 

Leu Cys Asn Val Asp lie Phe Asn Pro Lys Tyr Asp Cys Lys lie Met 
385 390 ~ 395 

ACT TCA AAA ACA GAT GTA AGC AGC TCC GTT ATC ACA TCT CTA GGA GCC 1249 
Thr Ser Lys Thr Asp Val Ser Ser Ser Val lie Thr Ser Leu Gly Ala 
400 405 410 

ATT GTG TCA TGC TAT GGC AAA ACT AAA TGT ACA GCA TCC AAT AAA AAT 12 9 7 

lie Val Ser Cys Tyr Gly Lys Thr Lys Cys Thr Ala Ser Asn Lys Asn 
415 " 420 425 

CGT GGA ATC ATA AAG ACA TTT TCT AAC GGG TGC GAT TAT GTA TCA AAT 134 5 

Arg Gly lie lie Lys Thr Phe Ser Asn Gly Cys Asp Tyr Val Ser Asn 
430 435 440 

AAA GGG ATG GAC ACT GTG TCT GTA GGT AAC ACA TTA TAT TAT GTA AAT 13 9 3 

Lys Gly Met Asp Thr Val Ser Val Gly Asn Thr Leu Tyr Tyr Val Asn 
445 450 455 " 460 

AAG CAA GAA GGT AAA AGT CTC TAT GTA AAA GGT GAA CCA ATA ATA AAT 1441 
Lys Gin Glu Gly Lys Ser Leu Tyr Val Lys Gly Glu Pro lie lie Asn 
465 470 475 

TTC TAT GAC CCA TTA GTA TTC CCC TCT GAT GAA TTT GAT GCA TCA ATA 14 8 9 

Phe Tyr Asp Pro Leu Val Phe Pro Ser Asp Glu Phe Asp Ala Ser lie 
480 485 ~ 490 

TCT CAA GTC AAC GAG AAG ATT AAC CAG AGC CTA GCA TTT ATT CGT AAA 153 7 

Ser Gin Val Asn Glu Lys lie Asn Gin Ser Leu Ala Phe lie Arg Lys 
495 500 505 

TCC GAT GAA TTA TTA CAT AAT GTA AAT GCT GGT AAA TCC ACC ACA AAT 15 8 5 

Ser Asp Glu Leu Leu His Asn Val Asn Ala Gly Lys Ser Thr Thr Asn 
510 515 520 

ATC ATG ATA ACT ACT ATA ATT ATA GTG ATT ATA GTA ATA TTG TTA TCA 163 3 

lie Met lie Thr Thr lie lie lie Val lie lie Val lie Leu Leu Ser 
525 530 535 540 

TTA ATT GCT GTT GGA CTG CTC TTA TAC TGT AAG GCC AGA AGC ACA CCA 1681 
Leu lie Ala Val Gly Leu Leu Leu Tyr Cys Lys Ala Arg Ser Thr Pro 
545 550 555 

C-TC ACA CTA AGC AAA GAT CAA CTG AGT GGT ATA AAT AAT ATT GCA TTT 172 9 

Val Thr Leu Ser Lys Asp Gin Leu Ser Gly lie Asn Asn lie Ala Phe 
560 565 570 

AGT AAC TAAATAAAAA T AG C AC C T AA TCATGTTCTT ACAATGGTTT ACTATCTGCT 17 85 
Ser Asn 

CATAGACAAC CCATCTGTCA TTGGATTTTC TTAAAATCTG AACTTCATCG AAACTCTCAT 1845 

CTATAAACCA TCTCACTTAC ACTATTTAAG TAGATTCCTA GTTTATAGTT ATAT 189 9 



(2) INFORMATION FOR SEQ ID NO: 20: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 574 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 0 : 

Met Glu Leu Leu lie Leu Lys Ala Asn Ala lie Thr Thr lie Leu Thr 
1 5 10 15 

Ala Val Thr Phe Cys Phe Ala Ser Gly Gin Asn lie Thr Glu Glu Phe 
20 25 30 

Tyr Gin Ser Thr Cys Ser Ala Val Ser Lys Gly Tyr Leu Ser Ala Leu 
35 40 ~ 45 

Arg Thr Gly Trp Tyr Thr Ser Val lie Thr lie Glu Leu Ser Asn lie 
50 55 60 

Lys Glu Asn Lys Cys Asn Gly Thr Asp Ala Lys Val Lys Leu lie Lys 
65 70 75 80 

Gin Glu Leu Asp Lys Tyr Lys Asn Ala Val Thr Glu Leu Gin Leu Leu 
85 ' 90 95 

Met Gin Ser Thr Pro Pro Thr Asn Asn Arg Ala Arg Arg Glu Leu Pro 
100 105 110 

Arg Phe Met Asn Tyr Thr Leu Asn Asn Ala Lys Lys Thr Asn Val Thr 
115 120 125 

Leu Ser Lys Lys Arg Lys Arg Arg Phe Leu Gly Phe Leu Leu Gly Val 
130 135 140 

Gly Ser Ala lie Ala Ser Gly Val Ala Val Ser Lys Val Leu His Leu 
145 150 155 160 

Glu Gly Glu Val Asn Lys lie Lys Ser Ala Leu Leu Ser Thr Asn Lys 
165 170 175 

Ala Val Val Ser Leu Ser Asn Gly Val Ser Val Leu Thr Ser Lys Val 
180 185 190 

Leu Asp Leu Lys Asn Tyr lie Asp Lys Gin Leu Leu Pro lie Val Asn 
195 200 205 

Lys Gin Ser Cys Ser He Ser Asn He Glu Thr Val He Glu Phe Gin 
210 215 220 

Gin Lys Asn Asn Arg Leu Leu Glu He Thr Arg Glu Phe Ser Val Asn 
225 230 235 240 

Ala Gly Val Thr Thr Pro Val Ser Thr Tyr Met Leu Thr Asn Ser Glu 
245 250 255 

Leu Leu Ser Leu He Asn Asp Met Pro He Thr Asn Asp Gin Lys Lys 
260 265 270 

Leu Met Ser Asn Asn Val Gin He Val Arg Gin Gin Ser Tyr Ser He 
275 280 285 
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Met Ser lie lie Lys Glu Glu Val Leu Ala Tyr Val Val Gin Leu Pro 
290 295 "300 

Leu Tyr Gly Val lie Asp Thr Pro Cys Trp Lys Leu His Thr Ser Pro 
305 310 " 315 320 

Leu Cys Thr Thr Asn Thr Lys Glu Gly Ser Asn lie Cys Leu Thr Arg 
325 330 335" 

Thr Asp Arg Gly Trp Tyr Cys Asp Asn Ala Gly Ser Val Ser Phe Phe 
340 345 350 

Pro Gin Ala Glu Thr Cys Lys Val Gin Ser Asn Arg Val Phe Cys Asp 
355 360 365 

Thr Met Asn Ser Leu Thr Leu Pro Ser Glu lie Asn Leu Cys Asn Val 
370 375 380 

Asp lie Phe Asn Pro Lys Tyr Asp Cys Lys lie Met Thr Ser Lys Thr 
385 390 395 400 

Asp Val Ser Ser Ser Val lie Thr Ser Leu Gly Ala lie Val Ser Cys 
405 410 415 

Tyr Gly Lys Thr Lys Cys Thr Ala Ser Asn Lys Asn Arg Gly lie He 
420 425 430 

Lys Thr Phe Ser Asn Gly Cys Asp Tyr Val Ser Asn Lys Gly Met Asp 
435 440 445 

Thr Val Ser Val Gly Asn Thr Leu Tyr Tyr Val Asn Lys Gin Glu Gly 
450 455 460 

Lys Ser Leu Tyr Val Lys Gly Glu Pro He He Asn Phe Tyr Asp Pro 
465 470 475 480 

Leu Val Phe Pro Ser Asp Glu Phe Asp Ala Ser lie Ser Gin Val Asn 
485 490 495 

Glu Lys He Asn Gin Ser Leu Ala Phe He Arg Lys Ser Asp Glu Leu 
500 505 "" 510 

Leu His Asn Val Asn Ala Gly Lys Ser Thr Thr Asn He Met He Thr 
515 520 525 

Thr He He He Val He He Val lie Leu Leu Ser Leu He Ala Val 
530 535 540 

Gly Leu Leu Leu Tyr Cys Lys Ala Arg Ser Thr Pro Val Thr Leu Ser 
545 550 555 560 

Lys Asp Gin Leu Ser Gly He Asn Asn He Ala Phe Ser Asn 
565 570 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 1 : 

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
1 5 10 15 
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